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Preface 


Computer science is a very large subject, and graduates pursue a wide variety of 
careers, from programming to computer graphics. Consequently, it is impossible to 
write a mathematics book that caters for all of these career paths. Nevertheless, I 
have attempted to describe a range of mathematical topics that I believe are relevant, 
and have helped me during my own career in computer science. 

The book’s subtitle A Visual Approach reflects the importance I place on coloured 
illustrations and function graphs, of which there are over 280 and over 100 tables. 
Many chapters contain a variety of worked examples. 

I am very conscious of any mathematical errors or spelling mistakes you may 
discover. I have done my best! 

This fourth edition remains an introductory text and is aimed at students studying 
for an undergraduate degree in computer science. There are now 26 chapters and 
I have included new material on statistics, curves, equations, Riemann hypothesis, 
analytic geometry and Fourier analysis, which should provide readers with a solid 
foundation, upon which more advanced topics of mathematics can be studied. 

I have referenced the key people behind the various mathematical discoveries 
covered, which I hope adds a human dimension to the subject. I have found it very 
interesting and entertaining to discover how some mathematicians ridiculed their 
fellow peers when they could not comprehend the significance of a new invention. 

There is no way I could have written this book without the assistance of the 
Internet and my books previously published by Springer Verlag. In particular, I 
would like to acknowledge Wikipedia and Richard Elwes’ excellent book Maths 
1001. I prepared this book on an Apple iMac, using I4TBX2,, Pages and the Grapher 
package, and would recommend this combination to anyone considering writing a 
book on mathematics. I do hope you enjoy reading this book, and that you are tempted 
to study mathematics to a deeper level. Mathematics is a good companion. 


Breinton, Herefordshire, UK John Vince 
May 2024 
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Chapter 1 A) 
Visual Mathematics б 


11 Introduction 


This opening chapter addresses the author’s writing style. 


1.2 Visual Brains Versus Analytic Brains 


I consider myself a visual person, as pictures help me understand complex problems. 
I don’t find it too difficult to visualise objects from different view points. І remember 
learning about electrons, neutrons and protons for the first time, where our planetary 
system provided a simple model to visualise the hidden structure of matter. My 
mental image of electrons was one of small orange spheres, spinning around a small, 
central nucleus containing blue protons and grey neutrons. And although this model 
is seriously flawed, and has been replaced by various fields of energy, it provided the 
first step towards understanding the structure of matter. 

As my knowledge of mathematics grew, this, too, was image based. Equations 
were straight lines, curves or surfaces; simultaneous equations were intersecting or 
parallel lines, etc.; and when I embarked upon computer science, I found a natural 
application for mathematics. For me, mathematics is a visual science, although I 
do appreciate that many professional mathematicians need only a formal, symbolic 
notation for constructing their world. Such people do not require visual scaffolding- 
they seem to be able to manipulate abstract mathematical concepts at a symbolic level. 
Their books do not require illustrations or diagrams-Greek symbols, upside-down 
and back-to-front Latin fonts are sufficient to annotate their ideas. 

Today, when reading popular science books on quantum theory, I still try to form 
images of 3D fields of energy and probability oscillating in space-to no avail-and I 
have accepted that human knowledge of such phenomena is best left to a mathematical 
description. Nevertheless, mathematicians such as Sir Roger Penrose, know the 
importance of visual models in communicating complex mathematical ideas. His 


© Springer Nature Switzerland AG 2024 1 
J. Vince, Foundation Mathematics for Computer Science, 
https://doi.org/10.1007/978-3-031-66549-3 1 


2 1 Visual Mathematics 


book The Road to Reality (Penrose 2004), is decorated with beautiful, informative, 
hand-drawn illustrations, which help readers understand the mathematics of science. 
In this book I rely heavily on images to communicate an idea. They are simple and 
are the first step on a ladder towards understanding a difficult idea. Eventually, when 
that Eureka moment arrives, that moment when you say to yourself, ‘I understand 
what you are saying,’ the image becomes closely associated with the mathematical 
notation. 


1.3 Learning Mathematics 


I was fortunate in my studies in that I was taught by people interested in mathematics, 
and their interest rubbed off on me. I feel sorry for children who have given up 
on mathematics, simply because they are being taught by teachers whose primary 
subject is not mathematics. I was never too concerned about the uses of mathematics, 
although applied mathematics is of special interest. 

One of the problems with mathematics is its incredible breadth and depth. It 
embraces everything from 2D geometry, calculus, topology, statistics, complex func- 
tions to number theory and propositional calculus. All of these subjects can be stud- 
ied superficially or to a mind-numbing complexity. Fortunately, no one is required 
to understand everything, which is why mathematicians tend to specialise in one or 
two areas and develop a specialist knowledge. 


1.4 What Makes Mathematics Difficult? 


“What makes mathematics difficult?’ is a difficult question to answer, but one that 
has to be asked and answered. There are many answers to this question, and I believe 
that problems begin with mathematical notation and how to read it; how to analyse 
a problem and express a solution using mathematical statements. Unlike learning a 
foreign language—which I find very difficult-mathematics is a language that needs 
to be learned by discovering facts and building upon them to discover new facts. 
Consequently, a good memory is always an advantage, as well as a sense of logic. 
Mathematics can be difficult for anyone, including mathematicians. For example, 
when the idea of 4/—1 was originally proposed, it was criticised and looked down 
upon by mathematicians, mainly because its purpose was not fully understood. Even- 
tually, it transformed the entire mathematical landscape, including physics. Similarly, 
when the German mathematician Georg Cantor (1845-1919) published his papers 
on set theory and transfinite sets, some mathematicians hounded him in a disgraceful 
manner. The German mathematician Leopold Kronecker (1823-1891) called Cantor 
a ‘scientific charlatan’ a ‘renegade’ and a 'corrupter of youth’, and did everything 
to hinder Cantor’s academic career. Similarly, the French mathematician and physi- 
cist Henri Poincaré (1854-1912) called Cantor’s ideas a ‘grave disease’, whilst the 
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Austrian-British philosopher and logician Ludwig Wittgenstein (1889-1951) com- 
plained that mathematics is ‘ridden through and through with the pernicious idioms 
of set theory.’ How wrong they all were. Today, set theory is a major branch of math- 
ematics and has found its way into every math curriculum. So don’t be surprised to 
discover that some mathematical ideas are initially difficult to understand—you are 
in good company. 


1.5 Does Mathematics Exist Outside Our Brains? 


Many people have considered the question ‘What is mathematics?’ Some mathe- 
maticians and philosophers argue that numbers and mathematical formulae have 
some sort of external existence and are waiting to be discovered by us. Personally, 
I don’t accept this idea. I believe that we enjoy searching for patterns and structure 
in anything that finds its way into our brains, which is why we love poetry, music, 
storytelling, art, singing, architecture, science, as well as mathematics. The piano, 
for example, is an instrument for playing music using different patterns of notes. 
When the piano was invented—a few hundred years ago-the music of Chopin, Liszt 
and Rachmaninoff did not exist іп any form-it had to be composed by them. Sim- 
ilarly, by building a system for counting using numbers, we have an amazing tool 
for composing mathematical systems that help us measure quantity, structure, space 
and change. Such systems have been applied to topics such as fluid dynamics, opti- 
misation, statistics, cryptography, game theory probability theory, and many more. 
I will attempt to develop this same idea by showing how the concept of number, 
and the visual representation of number reveals all sorts of patterns, that give rise to 
number systems, algebra, trigonometry, geometry, analytic geometry and calculus. 
The universe does not need any of these mathematical ideas to run its machinery, but 
we need these ideas to understand its operation. 


1.6 Symbols and Notation 


One of the reasons why many people find mathematics inaccessible is due to its 
symbols and notation. Let’s look at symbols first. The English alphabet possesses a 
reasonable range of familiar character shapes: 


a,b,c,d,e,f,¢,h,i,j,k,],m,n,0,p,q,1,8,t,U,V,W,X,Y,Z 
A,B,C,D,E,EG,H,LJ,K,L,M,N,O,PQ,R,S,T,U, VW,X, Y,Z 


which find their way into every branch of mathematics and physics, and permit us 
to write equations such as: 
E = mc? 
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and 
А = лт? 


It is important that when we see an equation, we are able to read it as part of the text. 
In the case of E = mc?, this is read as ‘Е equals m, c squared', where E stands for 
energy, т for mass, and c the speed of light. In the case of A = zr’, this is read as ‘A 
equals pi, ғ squared’, where A stands for area, л the ratio of a circle's circumference 
to its diameter, and r the circle's radius. Greek symbols, which happen to look nice 
and impressive, have also found their way into many equations, and often disrupt the 
flow of reading, simply because we don't know their English names. For example, 
the English theoretical physicist Paul Dirac (1902-1984) derived an equation for a 
moving electron using the symbols о; and В, which are 4 x 4 matrices, where: 


o; В + Ва; = 0 
and is read as: 
‘the sum of the products alpha-i beta, and beta alpha-i, equals zero.’ 


Although we will not come across moving electrons in this book, we will have 
to be familiar with the following Greek symbols: 


a alpha v nu 

B beta Е хі 

у сатта о omicron 

6 delta л рі 

є epsilon p tho 

$ zeta o sigma 

n eta т tau 

0 theta v upsilon 

г iota $ phi 

к kappa X chi 

A lambda V psi 

ш mu @ omega 
and some upper-case symbols: 

Г Gamma У Sigma 

A Delta Y Upsilon 

© Theta Ф Phi 

А Lambda W Psi 

я Xi (2 Omega 

I Pi 


Being able to read an equation does not mean that we understand it—but we are a 
little closer than just being able to stare at a jumble of symbols! Therefore, in future, 
when I introduce a new mathematical object, I will tell you how it should be read. 
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17 Punctuation 


Ihave looked at books on mathematics and discovered that their authors have adopted 
a variety of punctuation styles, from ignoring all punctuation for equations, to includ- 
ing all punctuation such as colons, semi-colons, commas and full-stops. In this book, 
I have tried to adopt a middle road, by minimising the use of punctuation, unless 
necessary. I must admit that I have been influenced by ChatGPT, which advised: 
whichever style I chose, keep it consistent. 


Reference 


Penrose R (2004) The road to reality: a complete guide to the laws of the universe. Cape 


Chapter 2 R) 
Numbers Башы 


2.1 Introduction 


This chapter revises sets, and the sets of numbers employed іп mathematics such as 
natural, integer, rational, irrational, real, algebraic, transcendental, imaginary, com- 
plex, quaternions and octonions. It also describes how these numbers behave in the 
context of three laws: commutative, associative and the distributive law. 

As prime numbers find their way into all aspects of cryptography, and who knows 
what else, the chapter introduces the fundamental theorem of arithmetic, prime num- 
ber distribution, perfect numbers and Mersenne numbers. The chapter concludes with 
the concept of infinity and some worked examples. 


2.2 Sets 


In mathematics, a collection of elements or members is a set. For example, if S is 
the set containing the elements A, B, C, D, E, we can state that: 


AES 
Bes 
Ces 
Des 
EeS 


where € stands for ‘is a member of’ whereas: 
FES 
¢ stands for ‘is not a member of.’ 


© Springer Nature Switzerland AG 2024 7 
J. Vince, Foundation Mathematics for Computer Science, 
https://doi.org/10.1007/978-3-031-66549-3 2 


8 2 Numbers 


The set S is written 5 = (A, B, C, D, Е}, and to isolate the elements A, B, C, 
we can say that the set (A, B, C] is a subset of S. The theory of sets is covered 
later in Chap. 5. 


2.3 Counting 


To count the members of a set we perform a one-to-one correspondence {bijection}, 
with the subset of positive integers (1, 2, 3,..., п). Thus, if we reach 23 in the 
one-to-one correspondence with the positive integers, there are 23 members. This is 
counting in mathematics. 

Our parents normally teach us to count by first memorising the counting words 
one, two, three, four, five, six, seven, eight, nine, ten, etc., and second, associating 
them with our fingers, so that when asked to count the number of elements in a 
picture book, each element is associated with a counting word. When the process is 
complete, the number of elements equals the last counting word reached. However, 
this still assumes that we know the meaning of one, two, three, four, etc. Memorising 
these counting words is only part of the problem-getting them in the correct sequence 
is the real challenge! Later on in this chapter we discover alternative sets of positive 
integers to the base of 10 employed in every-day mathematics. 

A well-ordered set possesses a unique order, such as the set of natural numbers 
N. Therefore, if P is the well-ordered set of prime numbers and N is the well-ordered 
set of natural numbers, we can write: 


P = {2, 3, 5,7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, .. .] 
Қ = (1,2, 3,4, 5,6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17,...} 


By pairing the prime numbers in P with the numbers in N, we have: 
(2. 1}, {3,2}, {5, 3}, (7, 4), (11, 5), (13, 6), (17, 7), (19, 8), (23, 9], ... 


and we can reason that 2 is the 1st prime, and 3 is the 2nd prime, etc. However, we 
still have to declare what we mean by 1, 2, 3, 4, 5, etc., and without getting too 
philosophical, I like the idea of defining them as follows. The word one, represented 
by 1, stands for one-ness of anything: one finger, one house, one tree, one donkey, etc. 
The word two, represented by 2, is ‘one more than one.’ The word three, represented 
by 3, is *one more than two,' and so on. 

We are now in a position to associate some mathematical notation with our num- 
bers by introducing the + and = signs. We know that + means add, but it also can 
stand for more. We also know that — means equal, and it can also stand for is the 
same as. Thus the statement: 

2-1--1 


is read аз ‘two equals one more than one.’ 
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We can also write: 
3-1--2 


which is read as ‘three equals one more than two.’ But as we already have а definition 
for 2, we can write: 


3=1+2 
=1+1+1 


Developing this idea, and including some extra combinations, we have: 


2=1+1 

3=1+2 
4=1+3=2 +2 
5=1+4=2 +3 

6= 1+5=2 +4=3 +3 
7=1+6=2+5 =3 +4 


etc. 


and can be continued without limit. 


2.4 Zero 


The concept of zero has a well-documented history, which shows that it has been used 
by different cultures over a period of two-thousand years or more. It was the Indian 
mathematician and astronomer Brahmagupta (598-с.-670), who argued that zero 
was just as valid as any natural number, with the definition: the result of subtracting 
any number from itself. However, even today, there is no universal agreement as to 
whether zero belongs to the set N, consequently, the set N° stands for the set of 
natural numbers including zero. 

In today’s positional decimal system, which is a place value system, the digit 
0 is a placeholder. For example, 203 stands for: two hundreds, no tens and three 
units. Although 0 е №, it does have special properties that distinguish it from other 
members of the set, and Brahmagupta also gave rules showing this interaction. 

If x є №, then the following rules apply: 


addition x+0= x 
subtraction x —0 = x 
multiplication x x0 = Охх =0 
division 0/x = 0 
undefined division х/0 
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The expression 0/0 is called an indeterminate form, as it is possible to show 
that under different conditions, especially limiting conditions, it can equal anything. 
So for the moment, we will avoid using it until we cover calculus. 


2.5 Negative Numbers 


When negative numbers were first proposed, they were not accepted with open 
arms, as it was difficult to visualise —5 of something. For instance, if there are 
5 donkeys in a field, and they are all stolen, the field is now empty, and there is 
nothing we can do in the arithmetic of donkeys to create a field of —3 donkeys. But 
may be we should be using natural numbers to represent the number of donkeys. 
However, in applied mathematics, numbers have to represent all sorts of quantities 
such as temperature, displacement, angular rotation, speed, acceleration, etc., and 
we also need to incorporate ideas such as left and right, up and down, before and 
after, forwards and backwards, etc. Fortunately, negative numbers are perfect for 
representing all of the above quantities and ideas. 

Consider the expression 4 — x, where x € №. When x takes on certain values, 
we have: 


4—123 
4—2= 

4—3=1 
4-4-0 


and unless we introduce negative numbers, we are unable to express the result of 
4-5. Consequently, negative numbers are visualised as shown in Fig. 2.1, where the 
number line shows negative numbers to the left of the natural numbers, which are 
positive, although the + sign is omitted for clarity. 

Moving from left to right, the number line provides a numerical continuum from 
large negative numbers, through zero, towards large positive numbers. In any calcu- 
lation we could agree that angles above the horizon are positive, and angles below 
the horizon, negative. Similarly, a movement forwards is positive, and a movement 
backwards is negative. So now we are able to write: 


Fig. 2.1 Тһе number line Ce ГО d 3-73 Ltt © d 
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without worrying about creating impossible conditions. 


2.5.1 Тһе Arithmetic of Positive and Negative Numbers 


Once again, Brahmagupta compiled all the rules, Tables 2.1 and 2.2, supporting the 
addition, subtraction, multiplication and division of positive and negative numbers. 
The real fly in the ointment, being negative numbers, which cause problems for 
children, math teachers and occasional accidents for mathematicians. Perhaps, the 
one rule we all remember from our school days is that ‘two negatives make a positive.’ 
We can show that this is true by first assuming that two negatives make a negative: 


2х-3-(4-6)4-17) 
6 4 16 — 24 — 28 — 42 


which is nonsense. However, if we assume that two negatives make a positive: 


2x —3 = (4 — 6)(4 — 7) 
6 = 16 — 24 — 28 + 42 


which is true. Naturally, there is a formal way of proving this. 


Another problem with negative numbers arises when we employ the square-root 
function. As the product of two positive or negative numbers results in a positive 


Table 2.1 Rules for adding and subtracting positive and negative numbers 


Table 2.2 Rules for multiplying and dividing positive and negative numbers 
x b => / b = 
а ab |—ab| a a/b | —a/b 
—a | —ab | ab | —a | —a/b | a/b 
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result, the square-root of a positive number gives rise to a positive and a nega- 


tive answer. For example, 4/4 = +2. This means that the square-root function only 
applies to positive numbers. Nevertheless, it did not stop the invention of the imag- 
inary unit i, where i? = —1. However, i is not a number, but an operator, and is 


described later. 


2.6 Observations and Axioms 


The following axioms or laws provide a formal basis for mathematics, and in the 
descriptions a binary operation is an arithmetic operation such as: +, —, x, / 
which operate on two operands. 


2.6.1 Commutative Law 


The commutative law in algebra states that when two elements are linked through 
some binary operation, the result is independent of the order of the elements. 
The commutative law of addition is: 


at+b=b+a 
eg. 1+2=2+1 


The commutative law of multiplication is: 


axb=bxa 


eg.1x222x1 
Note that subtraction is not commutative: 


a—bzb-a 
eg. 1-242-1 


2.6.2 Associative Law 


The associative law in algebra states that when three or more elements are linked 
together through a binary operation, the result is independent of how each pair of 
elements is grouped. 
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The associative law of addition is: 


a+(b+c)=(a+b)+c 
е.о.1--(2--3)- (1+2) +3 


The associative law of multiplication is: 


ax(bxc)=(axb)xc 
eg. 1х (2х3) = (1х 2) x3 


However, note that subtraction is not associative: 


a — (b — c) %+ (а= Б) – с 
e.g. 1—(2—3) = (1-2) – 3 


Which may seem surprising, but at ће same time confirms the need for clear axioms. 


2.6.3 Distributive Law 


The distributive law in algebra describes an operation which when performed on a 
combination of elements is the same as performing the operation on the individual 
elements. The distributive law does not work in all cases of arithmetic. For example, 
multiplication over addition holds: 


a(b+c)=ab+ac 
eg.2(34-4) 26-48 


Whereas addition over multiplication does not: 


a+ (b x c) = (a+b) x (a+c) 
e.g.3 + (4х 5) Z (3+4) х (3 + 5) 


Although these laws are natural for numbers, they do not necessarily apply to all 
mathematical objects. For instance, the vector product, which multiplies two vectors 
together, is not generally commutative. The same applies for matrix multiplication. 


2.7 Types of Numbers 


As mathematics evolved, mathematicians introduced different types of numbers to 
help classify equations and simplify the language employed to describe their work. 
These are the various types and their set names. 
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2.71 Natural Numbers 


The natural numbers (1, 2, 3, 4, ...} are used for counting, ordering and labelling 
and represented by the set N. When zero is included, N° or No is used: 


N° = № = (0, 1, 2,...} 


Note that negative numbers are not included. Natural numbers are used to subscript 
a quantity to distinguish one element from another, e.g. xi, Xo, Xs, Ха, ... 

Very often, we are interested in whether a natural number is divisible by another 
number, without leaving a remainder. Here are some divisibility tests, some of which, 
are very obvious. 

Divisible by 2: if the number ends with 0, 2, 4, 6, or 8. 

This is because the number abc = 100a + 10b + с, where 100a and 10b are both 
divisible by 2, therefore, abc is divisible by 2, iff (1f and only if) c is divisible by 2, 
i.e. is 0, 2, 4, 6, 8. 

Divisible by 3: if the number's digits sum to a multiple of 3. For instance, 135 is 
divisible by 3, as 1 + 3 + 5 = 9, so too, is 16254, because 1 +6 +2 + 5 +4 = 18, 
and 1 + 8 = 9. This is because the number abc = 99a + 9b +a +b + c, where 
99а + 9b is divisible by 3, therefore, abc is divisible by 3, iff a + b + c is divisible 
by 3. 

Divisible by 4: if the number formed from the last two digits is divisible by 4. 
For instance, 1324 is divisible by 4, as 24 is divisible by 4. This is because the 
number abc = 100a + 10b + c, where 100a, and all higher numbers, are divisible 
by 4, therefore, abc is divisible by 4, iff 10b + c is divisible by 4. If the number is 
c, the test only applies to c. 

Divisible by 5: if the number ends with a 0 or 5. 

Divisible by 6: if the number passes the divisibility test for 2 and 3. For instance, 
1464/6=244, and 1464 is an even number and is divisible by 3, because 1 + 4 + 6 + 
4 = 15. 

Divisible by 7: There is no simple test for divisibility by 7, but here is a starting 
point. Starting with a three-digit number, isolate the right-hand digit, and double it. 
Subtract this result from the two-digit number formed from the left-most digits. If 
this result is divisible by 7, so is the original number. For instance, to test 231 we 
remove the 1, and double it, making 2. We then subtract 2 from 23, which makes 21, 
which is divisible by 7, and 231. This is because every number can be written 10a + b 
where b is the right-hand digit. This is isolated and doubled, 2b, leaving behind a. 
Subtracting 2b from a gives a — 2b. We double 10a + b making 20a + 2b, as this 
does not interfere with the original test, and 20a + 2b = 21a — (a — 2b), where 21a 
is divisible by 7. Therefore, 10a + b is divisible by 7, iff a — 2b is divisible by 7. 
Sometimes it is quicker to divide by 7! 

Divisible by 8: This is the same test as 4, but the last three digits are examined. For 
instance, 25136 is divisible by 8, (25136/8 — 3142) as 136/8 — 17. This is because 
the number abcd = 1000a + 1005 + 10c + d, where 1000a and higher digits are 
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divisible by 8, therefore, abcd is divisible by 8, iff 100b + 10c + d is divisible by 
8. As 10000 is divisible by 16, the technique is extended to four digits. 

Divisible by 9: This is the same test as 3. For instance, 123453 is divisible by 9, 
as 1+2+3+4+5+3 = 18 is divisible by 9. This is because the number abc = 
99а + 9b + а + b + c, where 99a + 9b is divisible by 9, therefore, abc is divisible 
by 9, iff a + + c is divisible by 9. 


2.7.2 Integer Numbers 


Integer numbers include the natural numbers, both positive and negative, and zero, 
and are represented by the set Z: 


По =2,—1, 0, 8.8. 


Z is used because the word integer in German is ganzen Zahlen. Leopold Kro- 
necker apparently criticised Georg Cantor for his work on set theory with the jibe: 
‘Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Menschenwerk’ 
(Weber 1893). Which translates: “God made the integers, and all the rest is man’s 
work.’ implying that the rest are artificial. However, Cantor’s work on set theory and 
transfinite numbers proved to be far from artificial. 


2.7.3 Rational Numbers 


Any number that equals the quotient of one integer divided by another non-zero 
integer, is a rational number, and represented by the set О. For example, 2, 4/16, 
0.25 are rational numbers because: 


2 =4/2 
Ү16-4-8/2 
0.25 = 1/4 


Some rational numbers can be stored accurately inside a computer, but many others 
can only be stored approximately. For example, 4/3 produces an infinite sequence 
of threes, and written 1.3, and is truncated when stored as a binary number. 
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2.7.4 Irrational Numbers 


An irrational number cannot be expressed as the quotient of two integers. Irrational 
numbers never terminate, nor contain repeated sequences of digits, consequently, they 
are always subject to a small error when stored within a computer. Examples are: 


„2 = 1.41421356... (square-root of 2) 
ф = 1.61803398 ... (golden section) 
е = 2.71828182... (natural e) 

л = 3.14159265... (pi) 


2.7.5 Real Numbers 


Rational and irrational numbers comprise the set of real numbers R. Examples are 
1.5, 0.004, 12.999 and 23.0. We often use t є R to mean ‘t is a real number.’ 


2.7.6 Algebraic and Transcendental Numbers 


Polynomial equations with rational coefficients have the form: 
f(x) = ax" + bx"! pex"? 4...4€ 


such as: 
у = 3х2 +2х —1 


and their roots (solutions) belong to the set of algebraic numbers А. A consequence 
of this definition implies that all rational numbers are algebraic, since if: 


P 
x= 
q 


then: 
O=qx-p 


which is a polynomial. Numbers that are not roots to polynomial equations are 
transcendental numbers and include most irrational numbers, but not 4/2, since 


if: 
х =^/2 
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then: 
0-х-2 


which is а polynomial in x. 
Given a polynomial built from integers, for example: 


у = 3х3 — 4х? +х+ 23 


if the result is an integer, it is called an algebraic number, otherwise it is a tran- 
scendental number. Familiar examples of the latter being л = 3.141592653..., 
and e = 2.718281828..., which are represented the following continued fractions: 


4 
12 
32 
52 


2.7.7 Imaginary Numbers 


Imaginary numbers employ the symbol i to represent the impossible operation 
м —1. When combined with a real number they form a complex number which 
possesses vector-like properties. An imaginary number such as bi is defined as: 


beR, Р=-1 


Imaginary numbers obey all the axioms associated with real numbers: they can 
be added, subtracted, multiplied and divided. For example, given: 


18 


x = —6i 


у =3i 
therefore: 


x+y = –6і+ 3i= —3i 
Х-у--бі-3З1і--ді 
xy = —6i-3i = —18Р = 18 
x -бі 
m 
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Imaginary numbers appear to be an abstract idea created by mathematicians, just 
because it is possible. However, they play a very important role in mathematics, 
and are used to reveal patterns that would otherwise remain unnoticed. But apart 
from this, they are used in the design of electronic circuits, number theory, quantum 


mechanics, power distribution, and anything where waves are involved. 


2.7.8 Complex Numbers 


A complex number has a real and imaginary part: z = a + bi, and represented by 


the set C: 


т=а+ы, ЕС, a,beR, Di 


Examples аге: 


@= 1+1 
:1=3- 21 
z = —23 + м 231 


Complex numbers obey all the axioms associated with real numbers. For example, 


if we multiply a + bi by c + di we have: 


(a + bi)(c + di) = ac + adi + bci + bdi? 


Collecting up like terms and substituting —1 for i? we get: 


(a + bi)(c + di) = ac + (ad + bc)i — bd 
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which simplifies to: 
(a + bi)(c + di) = ac — bd + (ad + bc)i 


which is another complex number. 
For example, given: 
K= 2431 
у-3--4і 
Шеп: 
х+ у= (2+ 31) + (3+4) -5--71 
х= у= (2+ 31) – (3+4) = -1-1 
ху = (2 + 31)(3 + 4i) = 6 + 8i + 9i + 122 = —6 + 17i 
Something interesting happens when we multiply a complex number by its complex 
conjugate, which is the same complex number but with the sign of the imaginary 


part reversed: 
(a + bi)(a — bi) = a? — abi + abi — b’? 


Collecting up like terms and simplifying we obtain: 
(a + bi)(a — bi) 2 a? + b’ 
which is a real number, as the imaginary part is cancelled out by the action of the 


complex conjugate. Given a complex number y, its complex conjugate is represented 
by y. This permits us to divide one complex number by another as follows: 


х-2--3і 
у-3-41 
y=3-41 
x ху 
у yy 


_ 2+3) (3-4) 
T 344i) (3-44) 
6 — 8i 4- 91+ 12 
9+ 16 
18 +i 
~ 25 


18 |. 
= оу ast 


Chapter 16 provides more information on complex numbers. 
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2.7.9 Quaternions and Octonions 


In 1843, the brilliant Irish mathematician and physicist Sir William Rowan Hamil- 
ton (1805-1865), invented quaternions, represented by the set H, in honour of its 
inventor, and became the first non-commutative algebra: 


а=а+ы-+ с] +ак 


where: 
ЧЕН, а,Ь, с, deR Р=р=ю=-1 
y=k, ji=—k, jk=i, Кі--і, к=], ik=-j, yk=-1 


The imaginary products are shown in Table 2.3. 
Given two quaternions: 


x=a+bi+cj+dk 
y=e+ fi+gj+hk 


the product xy is: 


xy = (ae — bf — cg — dh) + (af + be + ch — dg) 
+ (ag + се+ df — bh)j+ (ah + de + bg — cf)k 


The American mathematician Josiah Willard Gibbs (1839-1903), realised that a 
quaternion’s imaginary part could be isolated and represent quantities with magnitude 
and direction, and 3D vectors were born: 


v = аі + bj + ck 


Almost immediately quaternions were invented, the hunt began for the next com- 
plex algebra, which was discovered simultaneously in 1843 by a colleague of Hamil- 
ton, John Thomas Graves (1806-1870), who called them octaves, and by the young 
English mathematician Arthur Cayley (1821-1895), who called them Cayley Num- 
bers: 


Table 2.3 The quaternion's imaginary products 


x ae 
i MaA 
к: 
Шоу |-i|-1 
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Fig. 2.2 The nested sets of 
numbers 


z=at+bi+ct+dk+ep+ fq+erths 


а, Б, с, а, е, f в, ЄК, Р =р= =p =g =r = 5 = -l 


They аге now called octonions, апа аге not only non-commutative, but non- 
associative, which means that in general, given three octonions A, B, C, then 
(АВ)С = A(BC). In 1898, the German mathematician Adolf Hurwitz (1859-1919), 
proved that there are only four algebras where it is possible to multiply and divide 
in the accepted sense: R, C, H, O. Figure 2.2 shows the sets of numbers diagram- 
matically. 

Quaternions are used to rotate points about a 3D axis, and if you wish to learn 
more about this subject refer to the author’s books (Vince 2018, 2021, 2022). 


2.8 Prime Numbers 


A prime number is defined as a natural number greater than | that is not a product 
of two smaller natural numbers The first five prime numbers are 2, 3, 5, 7, 11. We 
can prove that any positive integer must either be a prime, or the product of two or 
more primes, using the following reasoning: 

The set of natural numbers comprises two sets: primes and non-primes. A prime, 
by definition, has no factors, apart from 1 and itself. A non-prime has factors and is 
called composite. However, these factors are natural numbers, which must either be 
prime or non-prime. Eventually, the composite factors must decompose into com- 
posite primes. 

For example, 72 = 8 x 9, but 8 = 2? and 9 = 32, therefore, 72 = 2? х 32. Even 
starting with 72 = 6 x 12, but 6 = 2 x 3 and 12 = 2? x 3, therefore, 72 = 2? x 32. 
Table 2.4 shows the prime factors for the first 30 numbers. 
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Table 2.4 The prime factors for the first 30 numbers 


Number | Factors | Number | Factors | Number | Factors 
1 11 11 21 3x7 
2 12 22х3 22 2x11 
3 3 13 13 23 23 
4 2p 14 2x7 24 3 x3 
5 5 15 3x5 25 өр 
6 2х3 16 Pu 26 2x 13 
q 7 17 17 27 33 
8 2 18 2х32 28 227 
9 32 19 19 29 29 
10 2х5 20 |22x5| 30 |2x3x5 


2.8.1 Тһе Fundamental Theorem of Arithmetic 


Original work by the Greek mathematician Euclid (Mid-4th to mid-3rd century BC), 
revealed the Fundamental Theorem of Arithmetic (FTAr), also called the Unique 
Factorisation Theorem, which states that every integer greater than 1, is either 
prime or the unique product of primes, and is expressed symbolically as follows. Let 


Pi, P2, Рз,..., рк be prime numbers, and o, оо, 05,..., o be their associated 
positive integer powers: рү', p5^, рз’,..., py‘. We now use the product function 
П (Pi) to create the product: рү! ру? рз°,..., рр“, and introduce the variable i with 


a range of 1 to k, which permits the FTAr to be written as: 
п = př pž př... p =| [př 


where П is shorthand for ‘multiply together the associated terms.’ 

For example, 2250 equals the unique product: 213253, and 245 = 5172. To prove 
that these prime products are unique, let’s first assume that they are not, and show 
that this leads to a contradiction. 

Let n > 1 and equals the product of two prime numbers: п = pı p2. Now let's 
assume that n also equals the product of two other prime numbers: 4142. Therefore: 


р1р2 = 4142 


апа 
_ 4142 


P2 


рі 
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which implies that either 41/р> or q2/p2 factorises. However, this is impossible as 
41 and 42 are prime, therefore, the original assumption was incorrect. The same 
reasoning may be generalised to any number of prime factors. 


2.8.2 Isla Prime? 


You may notice in Table 2.4 showing factors that | is not a prime number, which 
has not always been the case. The reason is due to maintaining the logical integrity 
of the FTAr, which emphasises the uniqueness of the product of primes. If 1 was a 
prime, we would have the following non-unique products: 


24 225 х3 
24 = 1х 23 x3 


and it doesn’t seem satisfying to make 1 a prime number, and then qualify the FTAr 
with the rider: ‘This only applies for primes greater than 1.’ 


2.8.3 The Goldbach Conjecture 


In 1742, the German mathematician Christian Goldbach (1690-1764), conjectured 
that every even, positive integer greater than 2 could be written as the sum of two 
primes: 


4-2--2 

14 = 11--3 

18 = 11-7 

20 = 13 +7 
ес. 


No one has ever found an exception to this conjecture, and no one has ever 
confirmed it. If you want to know more about this topic then consider 
https://en. wikipedia.org/wiki/Goldbach%27s_conjecture. 


2.8.4 Prime Number Distribution 


As one moves higher through the set of natural numbers, new primes are uncovered. 
But every prime discovered increases the possibility for more composite numbers, 
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Table 2.5 Examples of primes and prime factors 


ШЕБІ EA S | AGP) ЭЗ OUEST N 

2 3 

21% 

2 |5 31 

2131517 211 

КУУ ү 2,311 

о ЭИ 30, 031 = 59 х 509 
2131517| 11) 13)) 17 10 Sill = 19 53970-0917 
ШІЗІБІ Т ШІГЕІНДІ ІМ GL GRIS GIL к= SER 9% 2), G5) 
PEM ESSET n GITE ESTE ТЕ 29 223, 092, 871 — 317 x 703, 763 
2131517 ТІ 13 17 101911231129 6, 469, 693, 231 = 331 x 571 x 34, 231 
СІЗ ИТТЕ ГІ 1922 091 31 200,560,490,131 


which overall, creates a falling distribution for primes. Table 2.5 shows that there аге 
10 primes in the first 30 numbers, and further analysis reveals 25 primes in the first 
100 numbers, after which, they slowly decline, but never disappear. 

The German mathematician Carl Gauss (1777-1855), proved, at the age of four- 
teen, that as x — oo, (‘x moves towards infinity’), the function z (x), which estimates 
the number of primes up to x, is given by: 


х 
л(х) ~ —— 
In x 
where ~ stands for “similar to’. 
Testing this for x — 100: 


100 100 


ж А 22 
ш 100 4.60517 


л (100) ~ 


which is lower than the actual value of 25. However, the French mathematician 
Adrien-Marie Legendre (1752-1833), conjectured the following relationship: 


X 


i Inx —B 


where B = 1.08366. But it appears that the best result is when B = 1. Testing this 
for x = 100: 


100 100 


100) ~ А А 
ОИ” 855 


28 


which is higher than the actual value of 25. 
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2.8.5 Infinity of Primes 


Euclid also showed that there are an infinite number of primes. As we know that the 
number of primes is either finite or infinite, we begin by assuming that the number 
is finite, and proving that the assumption is contradicted by an example. We begin 
by declaring that there are n primes: pi, po. рз, ···, pai, Pn. Next, we form 
the operation N = рух р X pa X ... X р„—у ри + 1, which can also be written 
using the product operation: 


М-Пр- 
i=l 


Now, N must be prime or have factors: 


1: If N is prime, then p, is not the largest prime, as assumed. 

2: If N has factors, it must be divisible by some prime factor. But this prime factor 
cannot include any of the original primes as there is a remainder of 1. Therefore, pn 
is not the largest prime, as assumed. Either way, the original assumption is incorrect; 
therefore, there must be an infinite number of primes. Table 2.5 lists some examples 
of these primes and prime factors. 

See (https://compoasso.free.fr) for an amazing list of prime numbers and related 
features. Also, readers interested in learning more about prime numbers should inves- 
tigate (Crandall and Pomerance 2005). 


2.8.6 Mersenne Numbers 


Numbers of the form 2^ — 1 are called Mersenne numbers, some of which, are also 
prime. The French theologian and mathematician Marin Mersenne (1588—1648), 
became interested in them towards the end of his life, and today they are known as 
Mersenne primes. 

By the end of the 16th-century, the highest Mersenne prime was 524,287 which 
equals 2!° — 1. At the start of the 21st-century, 243,112,609 1 was the highest, con- 
taining approximately 13 million digits! 

Apart from the fact that prime numbers are so mysterious, they are very important 
in public key cryptography, which is central to today’s Internet security systems. 


2.9 Perfect Numbers 


A perfect number equals the sum of its factors. For example, the factors of 6 are 1, 2 
and 3, whose sum is also 6. One would imagine that there would be a large quantity of 
small perfect numbers, but the first five are: 6, 28, 496, 8,128 and 33,550,336, which 
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are all even. And as the search continues to discover higher values, using computers, 
no odd perfect number has emerged. Euclid proved that if m is prime, and of the 
form 2 — 1, then m(m + 1)/2 is an even perfect number. For example, 3 is prime 
and: 


3x4 
3=22—1 ad ug 
2 
Similarly, 7 is prime and: 
7х8 
7=2 —1 and 2 = 28 


2.10 Triangular Numbers 


Triangular numbers are best visualised as shown in Fig. 2.3, where successive 
triangular numbers are constructed from equilateral triangles of circles. Thus, the 
sequence is 1, 3, 6, 10, 15, ..., oo. 

This sequence is generated by n (n + 1)/2, for the nth triangular number. 

The same numbers are found in the binomial theorem (5), covered in Chaps. 4 
and 6, where it is shown that: 


n n! 

= —————, Пп, КЕМ, и> 
k к 
2. 2х1 
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Fig. 2.3 First five triangular 
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2.11 Highest Common Factor 


The highest common factor (hcf) of two natural numbers is the largest natural 
number that divides into both numbers. For example, the hcf of 42 and 28 is 14, as 
42 = 2 x 3 x 7 and 28 = 2 x 2 х 7. A technique for finding the hcf is to represent 
each number as the product of prime numbers, and then find the string of prime 
products common to both numbers. For example, to find the hcf of 770 and 165 is to 
represent them as 770 = 2 x 5 x 7 x 11 and 165 = 3 x 5 x 11, therefore, the ВсЁ 
is5x 11 =55. 


2.12 Lowest Common Multiple 


The lowest common multiple (lcm) or lowest common denominator of two natural 
numbers is the smallest natural number into which they both divide. For example, 
the lcm of 18 and 24 is 72, because 18 = 2 x 3 x 3 and 24 = 2 x 2 x 2 x 3, and 
18 x 24/6 — 72. The technique used, is to multiply the two numbers in question, 
and divide by their hcf. 


2.3 Infinity 


Infinity is used to describe the size of unbounded systems. For example, prime 
numbers are infinite, so, too, are the sets of other numbers. Consequently, no matter 
how we try, itis impossible to visualise the size of infinity. Nevertheless, this did not 
stop Georg Cantor from showing that one infinite set could be infinitely larger than 
another. 

Cantor distinguished between those infinite number sets that could be counted, 
and those that could not. For Cantor, counting meant the one-to-one correspondence 
of a natural number with the members of another infinite set. If there is a clear 
correspondence, without leaving any gaps, then the two sets shared a common infinite 
size, called its cardinality using the first letter of the Hebrew alphabet aleph: N. The 
cardinality of the natural numbers N is No, called aleph-zero. 

Cantor discovered a way of representing the rational numbers as a grid, which 
is traversed diagonally, back and forth, as shown in Fig. 2.4. Some ratios appear 
several times, such as 2, 3, etc., which are not counted. Nevertheless, the one-to- 
one correspondence with the natural numbers means that the cardinality of rational 
numbers is also Xo. 

A real surprise was that there are infinitely more transcendental numbers than 
natural numbers. Furthermore, there are an infinite number of cardinalities rising to 
Ny. Cantor had been alone working in this esoteric area, and as he published his 
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Fig. 2.4 Rational number 1 1 1 1 1 
grid frets [4 |5 [77 
2 2 7T 2 x 2 2 “ 
1 2 4 5 i 
| 
{ 3 3 г 3 3 “ 3 
1 2 3 4 3 
4 ra 4 7T 4 " 4 4 
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| 5 5 zf 5 3 5 
1 2 3 4 5 
af 


results, he shook the very foundations of mathematics, which is why he was treated 
so badly by his fellow mathematicians. 


2.14 Worked Examples 


2.14.1 Algebraic Expansion 


Expand (а + b)(c + d), (a — b) (c + d), and (a — b) (с — d). 
Solution 


(a+ b)(c +4) = a(c -- d) -- b(c + d) 
= ac 4- ad + bc + bd 
(a — b)(c + d) = a(c + d) — b(c + d) 
= ac 4- ad — bc — bd 
(a — b)(c — d) = a(c — d) — b(c — d) 
— ac — ad — bc 4- bd 


2.14.2 Complex Numbers 


Compute (3 + 2i) + (2 + 2i) + (5 — 3i) and (3 + 23) 2 + 2i) (5 — 3i). 
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Solution 


(3 + 21) + (2+ 21) + (5 — 31) = 10 +1 


(3 + 2i)(2 + 21)(5 — 3i) = (3 + 21) (10 — 6i + 101+ 6) 
= (3 + 2i)(16 + 4i) 
= 48 + 12i + 32i — 8 
— 40 4- 44i 


2.14.3 Highest Common Factor 


Find the hcf of 760 and 330. 
Solution 


The hcf of 760 and 330 is to represent them as their prime products 760 = 2 x 2 x 
2x5 х 19 and 330 = 2 x 3 x 5 11, therefore, the hcf is 2 x 5 = 10. 


2.14.4 Lowest Common Multiple 


Find the lcm of 36 and 96. 
Solution 


The Іст of 36 апа 96 is 288, because 36 = 2 x 2 х 3x3 and 96—2x2x2x 
2 x 2 х 3, and 36 x 96/12 = 288. 


2.14.5 Оиаіегпіопѕ 


Compute (2 + 3i + 4j + k) + (6 + 2i +j + 2k) and (2 + 3i + 4j + К)(6+ 2i +j + 
2k). 


Solution 


(2 -- 3i 4j +k) + (64 2i - j 2k) = 8 + 5i + 5j + ЗК 
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(2+ 3i J- 4j +k)(6 + 2i+j + 2k) = 12 + 4i + 2j + 4k + 18i + 6i? + 3ij + бік 
+ 24] + 8ji + 4j? + 8jk + 6k + 2ki + kj + 2k? 
= 12 + 4i + 2j + 4k + 18i — 6 + 3k — 6j 
+ 24} — 8k — 4+ 81+ 6k + 2j —i— 2 
= 0 + 29i + 22j + 5k 
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Chapter 3 R) 
Systems of Counting узы 


3.1 Introduction 


This chapter covers four systems of representing numbers: the decimal positional, 
binary, octal and hexadecimal. The reader is probably very familiar the first system, 
but the last three may be new, and worthy of study. Finally, the chapter concludes 
with some worked examples. 


3.2 Decimal Positional System 
3.2.1 Background 


Over recent millennia, mankind has invented and discarded many systems for count- 
ing. People have counted on their fingers and toes, used pictures (hieroglyphics), cut 
marks on clay tablets (cuneiform symbols), employed Greek symbols (Ionic system) 
and struggled with, and abandoned Roman numerals (I, V, X, L, C, D, M, etc.), until 
we reach today's decimal positional system, which has Hindu-Arabic and Chinese 
origins. Let's investigate the algebra behind this system and see how it can be used 
to represent numbers to any base. 

The key to any positional counting system is the base employed. In the case of 
the decimal system, the base or radix is 10. 

The expression: 

A31, 000 + 45100 + 4110 + Aol (3.1) 


where Аз, Az, 21, Ao take on any value between 0 and 9, describes a whole number 
between 0 and 9, 999. By appending (3.2) to (3.1): 


A_10.1 + A 20.01 + A_30.001 (3.2) 


© Springer Nature Switzerland AG 2024 31 
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where 2-1, A_2, Аз take on any value between 0 and 9, any decimal number 
between 0 and 9, 999.999 can be represented. 
Indices bring this notation alive and reveal the true underlying pattern: 


23103 + А102 + 4110! + Ap 10° + A_; 107! + 115107? + A_3107? (3.3) 


Remember that any number raised to the power 0 equals 1. By adding extra terms 
both left and right in (3.3), any decimal number can be accommodated. 

In (3.3) 10 is the base, which means that the values of Аз to A_3 range between 0 
and 9, 1 less than the base. Therefore, by substituting В for the base we have: 


ДАНАЛЫҚ Ы SUE бан Ace AaB (3.4) 


where the values of A3 to A_3 range between 0 and В- 1. 


3.2.2 Binary Numbers 


Using (3.4), the binary number system has В = 2, and Аз to A_3 are 0 or 1: 
A323 + A22? + А121 + №020 + A27 + A927? +А 3273 ee 
The first 13 binary integers are: 
15, 102, 112, 1002, 1015, 1102, 1112, 10002, 10012, 10102, 10112, 11002, 1101; 
Thus 10011.11 is converted to decimal as follows: 


(1 x 24) + (0 x 2°) + (0 x 22) + (1 x 2) + (1 x 2°) + (1x 27) + (1x 27) 
(1х 16) + (0х 8) + (0х 4) + (1х2) 4- (1x 1) + (1х 0.5) + (1 x 0.25) 
(16 4- 2 + 1) + (0.5 + 0.25) 

19.75 


The reason why computers work with binary numbers-rather than decimal-is due to 
the difficulty of making electrical circuits that can store decimal numbers in a stable 
fashion. A switch, where the open state represents 0, and the closed state represents 
1, is the simplest electrical component to emulate. No matter how often it is used, or 
how old it becomes, it will generally behave like a switch. The main advantage of 
electrical circuits is that they can be switched on and off trillions of times a second, 
and the only disadvantage is that the encoded binary numbers and characters contain 
a large number of bits, and humans are not familiar with binary. 
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3.2.3 Octal Numbers 


Using (3.4), the octal number system has B = 8, and A3 to A_3 range between 0 
and 7: 


+++ + A387 238? + 2181 + A08? + A-187! A198 7 A387 +... 
The first 17 octal integers are: 

1g, 28, 3g, 48, 58, бз, 7g, 10g, 118, 128, 13g, 148, 158, 16g, 178, 20g, 218 
The subscript 8 reminds us that although we may continue to use the words ‘twenty- 
five’, it is an octal number, and not a decimal. But what is 25g in decimal? Well, it 


stands for: 
2х8! + 5х 80 = 21 


Thus 245.4 is converted to decimal as follows: 


(2x 8) + (4 x 8) + (5х 80) + (4х 87!) 
(2х 64) + (4 x 8) + (5 x 1) + (4 x 0.125) 
(128 + 32 + 5) + (0.5) 

165.5 


Counting in octal appears difficult, simply because we have never been exposed to 


it, like the decimal system. If we had evolved with 8 fingers, instead of 10, we would 
be counting in octal! 


3.2.4 Hexadecimal Numbers 


Using (3.4), the hexadecimal number system has В = 16, and Аз to A_3 can be 0 
to 15, which presents a slight problem, as we don’t have 15 different numerical char- 
acters. Consequently, we use 0 to 9, and the letters A, B, С, D, Е, Е to represent 
10, 11, 12, 13, 14, 15 respectively: 

+ А316? + 2316? + А116! + A916? + A-1167! + 11216 ? + 11316 2 +- 


The first 17 hexadecimal integers are: 


116, 216. 316. 416. 516, 616, 716, 816, 916, A16, B16, С16, D16, E16, Fie, 1016, 1116 
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Thus 2Е.816 is converted to decimal as follows: 
(2 x 16) + (E x 1) + (8 x 167!) 


(32 + 14) + (8/16) 
46.5 


3.3 Converting Decimal to Binary, Octal and Hexadecimal 
3.3.1 Converting Decimal to Binary 


To convert a decimal number to binary we can either work out the collection of 


1s and Os by reasoning, or employ an algorithm. Let's illustrate this by finding the 
binary equivalent of decimal 54. Using the first technique, we arrive at the result 
110110», because: 


1x2232 
1х 24 = 16 
0х 23 =0 
1x2 =4 
1х 21 =2 
0х 20 =0 


110110; = 54 = (32+ 16+4+2) 


Using an algorithm, we divide 54 by 2, and then repeatedly divide the previous result 
by 2: 


= 27 remainder 0 
= 13 remainder 1 
= б remainder 1 
= 3 remainder 0 


= 1 remainder 1 


кю юрокюрву dl oN Sv. @ 


= 0 remainder 1 


The remainders in reverse order provide the result, 1101102. 
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Let’s do the same for decimal 115: 


1х 2° = 64 
1х 25 = 32 
1х 24 = 16 
0х 23 =0 
0х 22 =0 
1x2! 22 
1х 20 =1 


11100115 = 115 = (64 + 32 + 16+2-+0 


Using the same algorithm: 


115 : 
= 57 remainder 1 
57 . 
5 = 28 remainder 1 
28 : 
>: = 14 геташаег 0 
14 : 
>= 7 remainder 0 
7 . 
та 3 remainder 1 
3 А 
5 = 1 геташаег 1 
1 : 
2 = 0 remainder 1 


The remainders in reverse order provide the result, 11100115. 


3.3.2 Converting Decimal to Octal 


To convert a decimal number to octal we can either work out the collection of 
digits by reasoning, or employ an algorithm. Let’s illustrate this by finding the octal 
equivalent of decimal 97. Using the first technique, we arrive at the result 1418, 
because: 


1х 82 = 64 
4х 8! = 32 
1х 8° = 1 


1418 = 97 = (64+ 32 + 1) 
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Using an algorithm, we divide 97 by 8, and then repeatedly divide the previous result 
by 8: 


7 
* = 12 remainder 1 
12 . 
r3 — ] remainder 4 
1 Я 
8 = 0 remainder 1 


The remainders in reverse order provide the result, 1418. 
Let’s do the same for decimal 1,235: 


2х 83 = 1, 024 
3 x 8? = 192 
2х8! = 16 
3х 89 =3 


2323; = 1,235 = (1, 024 + 192 + 16 + 3) 


Using the same algorithm: 


1,2 
= = 154 remainder 3 
154 қ 
A = 19 remainder 2 
19 . 
* — 2 remainder 3 
2 ' 
8 = 0 remainder 2 


The remainders in reverse order provide the result, 23233. 


3.3.3 Converting Decimal to Hexadecimal 


To convert a decimal number to hexadecimal we can either work out the collection 
of digits by reasoning, or employ an algorithm. Let’s illustrate this by finding the 
hexadecimal equivalent of decimal 427. Using the first technique, we arrive at the 
result 1 A B16, because: 
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1 x 16? = 256 
A x 16! = 160 
Вх 16°=11 


1АВ = 427 = (256 + 160 + 11) 


Using an algorithm, we divide 427 by 16, and then repeatedly divide the previous 
result by 16: 


427 . 
16 = 26 remainder 11 = В 


26 ; 
16 = 1 remainder 10 = A 


1 
— = O remainder 1 
16 


The remainders in reverse order provide the result, 1 A B16. 
Let's do the same for decimal 9,131: 


2x 163 = $, 192 


3 x 16? = 768 
Ах 16! = 160 
Вх 16° = 11 


23АВ = 9, 131 = (8, 192 + 768 + 160 + 11) 


Using the same algorithm: 


9,131 | 
16 = 570 remainder 11 = B 
570 | 
16 = 35 remainder 10 = А 
35 
-- = 2 remainder 3 
16 
-- = 0 remainder 2 
16 


The remainders in reverse order provide the result, 23A Big. 


38 3 Systems of Counting 


3.4 Converting Between Binary and Octal Numbers 


Although it is not obvious, binary, octal and hexadecimal numbers are closely related, 
which is why they are part of a programmer’s toolkit. Even though computers work 
with binary, it’s the last thing a programmer wants to use. So to simplify the man- 
machine interface, binary is converted into octal or hexadecimal. To illustrate this, 
let's convert the 16-bit binary code 11010110001100015 into octal. 

Using the following general binary integer: 


2828 + А727 + А620 + А52? + А42 + А32? + А222 + 2121 + А020 
we group the terms into threes, as 23 = 8, starting from the right: 
(A329 + A727 + 2628) + (A52° + 424 + А32?) + (452? + 242! + №029) 
Simplifying: 


2° (822? + A42! + 620) + 23 (A52? + 442! + А320) + 29 (452? + A12! + A92?) 
8? (A42? + A72! + Ав) +8' (А522 + 442! + Аз) + 8° (452° + 2121 + Ao) 
8R+8'S + 8°T 


where: 


К = А2? + A12! + А6 
S = А52? + 2421 + Аз 
T = 152? +4121 + do 
and the values of R, S, T vary between 0 and 7. Therefore, given 


11010110001100012, we divide the binary code into groups of three, starting at 
the right, and adding two leading zeros: 


(001)(101)(011)(000)(110)(001)2 
For each group, multiply the zeros and ones by 4, 2, 1, right to left: 


0+0+1)44+0+4+ 104+2+10+04+0)4+2+4+0)0+0+4 1) 
(1)(5) 8)(0) (6) (1) 
1530618 


Therefore, 11010110001100015 = 1530618, which is much more compact. Тһе 
secret of this technique is to memorise the following patterns: 
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0002 = 0 
0015 = 1; 
0105 = 2% 
011, = 3g 
1002 = 4g 
1015 = 5g 
1105 = бұ 
1115 = 7g 


Here are a few more examples, with the binary digits grouped in threes: 


1115 = 7 
101101; = 558 
100000; = 40g 
111000111000 111; = 707078 


It’s just as easy to reverse the process, and convert octal into binary. Here are some 
examples: 


5678 = 1011101115 
238 = 0100115 
17418 = 001111100001; 


3.5 Converting Between Binary and Hexadecimal Numbers 


A similar technique is used to convert binary to hexadecimal, but this time we 
divide the binary code into groups of four, as 2+ = 16, starting at the right, and 
adding leading zeros, if necessary. To illustrate this, let’s convert the 16-bit binary 
code 11010110001100015 into hexadecimal. 

Using the following general binary integer number: 


142! + А626 + A522 + Ag2* + A422 + A322 + эл2! + A92? (3.5) 
From the right, we divide the binary code into groups of four: 


(A727 + A626 + А52? + 542°) + (А32? + А222 + А12! + A92?) 
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Simplifying: 


24 (A72? + А622 + A52! + А420) + 2° (А32? + А222 + 442! + А020) 
16! (А72? + А622 + А52 + Ag) + 160 (A327 + 252? + 242! + Ao) 
16'S + 16?T (3.6) 


where: 


а 
Tei 437-4) 39 


and the values of S and T vary between 0 and 15. 
By including a further group of four terms to (3.5): 


Aid Fa Aaa pA HA2 
we introduce a third term 16? А to (3.6): 
16? К + 16'S + 16?T 
where: 
R = 51125 + A492? + A92! + Ag 
S = A723 + А622 + 2521 + А4 
T = 5325 + А22 +112! + А 
and the values of R, S and T vary between 0 and 15. 
Therefore, given 11010110001100015, we divide it into groups of four, starting 


at the right: 
(1101)(0110)(0011)(0001)2 


For each group, multiply the zeros and ones by 8, 4, 2, 1, right to left, respectively: 


(8+4+0+1)04+4+2+0)0+04+2+1)0+0+0+4 1) 
(13)(6)(3)(1) 
063116 


Therefore, 11010110001100015 = 06316, which is even more compact than its 
octal value 1530615. 

Ihave deliberately used whole numbers in the above examples, but they can all be 
extended to include a fractional part. For example, when converting a binary number 
such as 11.1101» to octal, the groups are formed about the binary point: 


(011).(110)(100)2 = 3.648 
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Table 3.1 Тһе first twenty decimal, binary, octal, and hexadecimal integers 


Decimal | Binary | Octal | Hex Decimal | Binary Octal | Hex 
1 1 1 1 11 1011 13 | B 
2 10 2) 2 12 100 14 | C 
3 11 3 3 13 1101 15 | D 
4 100 4 4 14 1110 | 16 | E 
5 101 5 5 15 1111 17 F 
6 110 6 6 16 10000 | 20 | 10 
7 111 7 7 17 10001 | 21 | 11 
8 1000 | 10 8 18 10010 | 22 | 12 
9 1001 11 9 19 10011 | 23 | 13 
10 1010 | 12 | A 20 10100 24 | 14 


Similarly, when converting a binary number such as 101010.100110 to hexadecimal, 
the groups are also formed about the binary point: 


(0010) (1010).(1001) (1000) = 2А.9816 


Table 3.1 shows the first twenty decimal, binary, octal and hexadecimal numbers. 


3.6 Adding and Subtracting Binary Numbers 


3.6.1 Adding Binary Numbers 


When adding binary numbers, which consist of Os and 1s, the possible arrange- 
ments are four, as shown in Table 3.2, which greatly simplifies the process. 
For example, to add 124 to 188 as two binary integers, we write: 


10111100; = 188 
01111100; = 124 
100111000, = 312 


Table 3.2 Addition of two binary integers showing the carry 


ЕН 
0/0 
1 


0 1 
1 
1 10 
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Such addition is easily undertaken by digital electronic circuits, and instead of having 
separate circuitry for subtraction, it is possible to perform subtraction using the 
technique of two’s complement. 


3.6.2 Subtracting Binary Numbers Using Two’s Complement 


Two’s complement is a technique for converting a binary number into a form such 
that when it is added to another binary number, it results in a subtraction. There are 
two stages to the conversion: inversion, followed by the addition of 12. To begin, we 
establish an agreed resolution, 10 bit. For example, 24 in binary is 00000110002, and 
is inverted by switching every 1 to 0, and vice versa: 11111001115. Next, we add 15: 
11111010005, which now represents — 24. If this is added to binary 36: 00001001002, 
we have: 


0000100100, = 36 
1111101000. = —24 
10000001100; = 12 


Note that the last high-order addition creates a carry of 15, which is ignored. Here 
is another example, 100 — 30: 


00000111105 = 30 
invert 1111100001, 
add 15 00000000015 


1111100010; = —30 
add 00011001005 = 100 
10001000110 = 70 


Ignore the high-order carry = 14, and the result is 10001105 = 70. 


3.7 Adding and Subtracting Decimal Numbers 
3.7.1 Adding Decimal Numbers 


When we are first taught the addition of integers containing several digits, we are 
advised to solve the problem digit by digit, working from right to left. For example, 
to add 254 to 561 we write: 
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Table 3.3 Addition of two decimal integers showing the carry 


+ 07 2 35 à S 6 8 9 
Шо 1 2 3 4 S 6 7 8 9 
МІ 2 3 4 5 6 7 8 9 0 
2 3 4 5 6 7 8 9 № 1] 
EM 3 4 5 6 7 8 9 0 11 15 
а 5: а т 55 5 ШІ 2 Қ 
m5 6 7 з 9 0 ч 2 3 4 
omo 7 5 9 0 2 5 4 5 
Ж 7 8 9 0 11 2 5 !4 5 6 
НВ 9 0 1 2 3 М4 15 6 17 
о 19 1 2 5 4 5 16 7 5% 


561 
254 
815 


where 4+ 1 = 5, 5 + 6 = 1 with a carry = 1,2 + 5 + carry = 8. 
Table 3.3 shows all the arrangements for adding two decimal integers with the 


carry shown аз Уп. 


3.7.2 Subtracting Decimal Numbers Using Ten’s 
Complement 


The binary system is not the only system that permits a subtraction to be represented 
as an addition, any base can be used. So let’s show how it is possible to subtract two 
decimal numbers using ten’s complement. There are two stages to the conversion: 
inversion, followed by the addition of 1. For example, let’s compute 465-148, which 
results in 317. Using a resolution of 5 digits. First, we invert 00148 by subtracting 
every digit from 9: 99, 851. Next, we add 1: 99, 852, which now represents —148. 
If 99, 852 is added to 00465: we have '00317. Ignore the carry, and the answer is 
317. 

Here is another example, 2, 145 — 312, which results in 1, 833. The resolution is 
6 digits: 
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000, 312 

invert 999, 687 
add 1 999, 688 
add 002, 145 
101, 833 


Ignore the carry, and we have 1,833. 


3.8 Adding and Subtracting Octal Numbers 


3.8.1 Adding Octal Numbers 


To add two octal numbers together, we use Table 3.4. For example, let’s add 1558 to 
1318: 
1558 
1318 
306. 


3.8.2 Subtracting Octal Numbers Using Eight’s Complement 


As an exercise, we will show how two octal numbers can be subtracted using eight’s 
complement. As with previous subtractions, there are two stages to the conver- 
sion: inversion, followed by the addition of 1g. Again, the final carry is ignored. 
For example, let’s compute 155g - 1318, which results in 24g. The resolution is 6 
octal digits. First, we invert 000131: by subtracting every digit from 7g: 777646g. 


Table 3.4 Addition of two octal integers showing the carry 


mo i1 2 заве 7 
Шо 1 2 3 4 5 6 7 
IM 2 3 4 5 6 7 о 
puo 3 4 5 6 7 0 11 
из 4 5 6 7 № Ш 1 
a 4 5 —6 1 о 9 5 
ЕН5 6 7 д 13 о 58 М 
о 7 0 1 2 13 14 15 
o pw My qp 2 "gs М "S G 
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Next, we add 13: 7776473, which now represents —1318. If 0001553 is added to 
777647: we have 10000248. Ignore the carry, and the answer is 248. 

Here is another example, 2145; — 3123, which results in 16338. The resolution is 
6 octal digits: 


0003128 

invert 7774658 
ааа Із 1777466; 
add 002145; 
1001633; 


Ignore the carry, and we һауе 16338. 


3.9 Summary 


The above descriptions of the decimal position system, binary, octal and hexadecimal 
numbers should be sufficient to allow the reader to understand the following worked 
examples. Naturally, there are algorithms for converting numbers from one base to 
another, and the reader may wish to explore these. 


3.10 Worked Examples 


3.10.1 Convert a Decimal Number into Binary 


Convert 13.75 into binary. 
Solution 


19:75: о 15350 ee E 
=1x24+1x274+0x2!'4+1x2°4+1x2-!4+1x27? 
13.75 = 1101.115 


3.10.2 Convert a Decimal Number into Binary Using an 
Algorithm 


Convert 39 into binary. 
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Solution 


Using an algorithm, we divide 39 by 2, and then repeatedly divide the previous result 
by 2: 


= 19 remainder 1 
= 9 remainder 1 
— 4 remainder 1 
= 2 remainder 0 


= 1 remainder 0 


— 195) 
VIE NINNIAN vo ЧЕЧЕ 


— 0 remainder 1 


The remainders in reverse order provide the result, 1001115. 


3.10.3 Convert a Binary Number into Decimal 


Convert 11011.15 into decimal. 


Solution 
10000; = 16 
10002 = 8 
1002 2 
12 = 1 
0.15 = 0.5 


11011.15 = 27.5 


3.10.4 Convert a Binary Number тю Octal 


Convert 11011.15 into octal. 
Solution 


11011.15 = (011)(011).(100)> 
(011)(011).(100); = 33.4 
11011.15 = 3344 
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3.10.5 Convert an Octal Number into Binary 


Convert 45.6 into binary. 


Solution 


45.6; = (100)(101).(110)» 
(100)(101).(110); = 100101.115 
45.6; = 100101.11> 


3.10.6 Convert an Octal Number into Hexadecimal 


Convert 123.45; into hexadecimal. 


Solution 


123.458 (001)(010)(011).(100) (101) 
(001)(010) (011).(100) (101) (0101) (0011).(1001) (0100)5 
(0101) (0011).(1001) (0100)5 53.9446 


123.45g = 53.9416 


3.10.7 Convert a Hexadecimal Number into Octal 


Convert А1В.А2 6 into octal. 


Solution 


А1В.А216 2 (1010)(0001)(1011).(1010)(0010)2 
(1010)(0001)(1011).(1010)(0010)2 = (101)(000)(011)(011).(101)(000)(100)2 
(101)(000)(011)(011).(101)(000)(100)2 = 5033.5048 
А1В.А216 =  5033.504g 
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3.10.8 Convert a Decimal Number into Octal 


Convert 123.40625 into octal. 


Solution 


1000000 
100000 

10000; = 

10005 

105 

1, 

0.015 

0.0015 
0.000015 = 
123.40625 
1111011.011015 = 
(001) (111)(011).(011)(010)5 = 


32 
16 


1 

0.25 

0.125 

0.03125 

1111011.011015 

(001) (111) (011).(011) (010) 
173.328 


123.40625 = 


173.328 


3.10.9 Convert a Decimal Number into Octal Using ап 


Algorithm 


Convert 123 into octal. 


Solution 


Using an algorithm, we divide 123 by 8, and then repeatedly divide the previous 


result by 8: 


123 
8 


1 


= 15 remainder 3 


= ] remainder 7 


— = 0 remainder 1 


со 


The remainders in reverse order provide the result, 1738. 
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3.10.10 Convert a Decimal Number into Hexadecimal 


Convert 123.40625 into hexadecimal. 


Solution 
1000000; = 64 
1000005 = 32 
10000; = 16 
1000; = 8 
105 = 2 
12 = 1 
0.015 = 0.25 
0.0015 = 0.125 
0.000015 = 0.03125 
123.40625 =  1111011.011015 


1111011.011015 =  (0111)(1011).(0110)(1000)5 
(0111)(1011).(0110)(1000)2 = 7В.6816 
123.40625 = 7B.6816 


3.10.11 Ааа Binary Numbers 
Add 1101101; апа 10110012. 
Solution 


11011015 = 109 
10110015 89 
11000110. = 198 


3.10.12 Subtract Binary Numbers 


Subtract 10110015 from 11011015 using two’s complement. 


50 
Solution 
1011001, = 89 
inversion 0100110» 
add 1, 00000015 
01001115 = —89 
ааа 109 11011015 = 109 
000101005 = 20 
3.10.13 Add Octal Numbers 
Add 155; and 1318. 
Solution 
155; = 109 
1318 = 89 
306; = 198 
3.10.14 Subtract Octal Numbers 
Subtract 23g from 1027s. 
Solution 
00023; 
invert 000233 777548 
add lg 777553 
ааа 010278 010273 
010048 


Ignore the final carry, we have 10045. 
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3.10.15 Add Hexadecimal Numbers 


Add 6р апа 5916. 


Solution 


60 = 109 
5916 = 89 
C6 = 198 


Chapter 4 A) 
Algebra cheret 


4.1 Introduction 


This chapter revises the elements of algebra such as notation, rules, indices, loga- 
rithms, explicit and implicit functions, intervals, function domains and ranges, odd 
and even functions and power series. The chapter concludes with some worked 
examples. 


4.2 Background 


Some people, including me, find learning a foreign language a real challenge; one 
of the reasons being the inconsistent rules associated with its syntax. For example, 
why is a table feminine in French, ‘la table’, and a bed masculine, ‘le lit’? They both 
have four legs! The rules governing natural language are continuously being changed 
by each generation, whereas mathematics appears to be logical and consistent. The 
reason for this consistency is due to the rules associated with numbers and the way 
they are combined together and manipulated at an abstract level. Such rules, or 
axioms, generally make our life easy, however, as we saw with the invention of 
negative numbers, extra rules have to be introduced, such as ‘two negatives make 
a positive’, which is easily remembered. However, as we explore mathematics, we 
discover all sorts of inconsistencies, such as there is no real value associated with 
the square-root of a negative number. It’s forbidden to divide a number by zero. Zero 
divided by zero gives inconsistent results. Nevertheless, such conditions are easy 
to recognise and avoided. At least in mathematics, we don’t have to worry about 
masculine and feminine numbers! 

As a student, I discovered Principia Mathematica (Russell and Whitehead 1903), 
a three-volume work written by the British philosopher, logician, mathematician and 
historian Bertrand Russell (1872-1970), and the British mathematician and philoso- 
pher Alfred North Whitehead (1861-1947), in which the authors attempted to deduce 
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all of mathematics using the axiomatic system developed by the Italian mathemati- 
cian Giuseppe Peano (1858-1932). The first volume established type theory, the 
second was devoted to numbers, and the third to higher mathematics. The authors 
did intend a fourth volume on geometry, but it was too much effort to complete. It 
made extremely intense reading. In fact, I never managed to get pass the first page! 
It took the authors almost 100 pages of deep logical analysis in the second volume 
to prove that 1 + 1 = 2! 
Russell wrote in his Principles of Mathematics (Russell 1938): 


The fact that all Mathematics is Symbolic Logic is one of the greatest discoveries of our 
age; and when this fact has been established, the remainder of the principles of mathematics 
consists in the analysis of Symbolic Logic itself. 


Unfortunately, this dream cannot be realised, for in 1931, the Austrian-born, and 
later American logician and mathematician Kurt Gódel (1906-1978), showed that 
even though mathematics is based upon a formal set of axioms, there will always be 
statements involving natural numbers that cannot be proved or disproved. Further- 
more, a consistent axiomatic system cannot demonstrate its own consistency. These 
theorems are known as Gódel's Incompleteness theorems. 

Even though we start off with some simple axioms, it does not mean that everything 
discovered in mathematics is provable, which does not mean that we cannot continue 
our every-day studies using algebra to solve problems. So let's examine the basic 
rules of algebra and prepare ourselves for the following chapters. 


4.3 Notation 


Modern algebraic notation has evolved over thousands of years where different 
civilisations developed ways of annotating mathematical and logical problems. The 
word algebra comes from the Arabic al-jabr w'al-mugabal meaning ‘restoration 
and reduction.' In retrospect, it does seem strange that centuries passed before the 
‘equals’ sign (=) was invented, and concepts such аз ‘zero’ (CE 876) were introduced, 
especially as they now seem so important. But we are not at the end of this evolution, 
because new forms of annotation and manipulation will continue to emerge as new 
mathematical objects are invented. 

One fundamental concept of algebra is the idea of giving a name to an unknown 
quantity. For example, m is often used to represent the slope of a 2D line, and c 
is the line's y-coordinate where it intersects the y-axis. The French mathematician 
René Descartes (1596-1650), formalised the idea of using letters from the beginning 
of the alphabet (а, b, c, ...) to represent arbitrary quantities, and letters at the end 


of the alphabet (р, q, r, 5, t,..., x, y, z) to represent quantities such as pressure 
(p), time (f) and coordinates (x, y, z). 
With the aid of the basic arithmetic operators: +, —, x, / we can develop expres- 


sions that describe the behaviour of a physical process or a logical computation. For 
example, the expression ax + by — d equals zero for a straight line. The variables x 
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and y are the coordinates of any point on the line and the values of a, b and d deter- 
mine the position and orientation of the line. The = sign permits the line equation to 
be expressed as a self-evident statement: 


O=ax+by—d 


Such a statement implies that the expressions on the left- and right-hand sides of 
the = sign are ‘equal’ or ‘balanced’, and in order to maintain equality or balance, 
whatever is done to one side, must also be done to the other. For example, adding d 
to both sides, the straight-line equation becomes: 


d = ax + by 


Similarly, we could double or treble both expressions, divide them by 4, or add 6, 
without disturbing the underlying relationship. When we are first taught algebra, we 
are often given the task of rearranging a statement to make different variables the 
subject. For example, (4.1) can be rearranged such that x is the subject: 


x+4 
Ja (4.1) 


2 


1 
y(2-2)=x44 
2 


Subtract 4 from both sides, and swap sides: 


1 
x=y е —4 


Making z the subject requires more effort: 


Multiply both sides by (2 — 


Nl 
Мм 


Multiply both sides by (2 — 1): 


Expand the left-hand side: 
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Subtract 4 from both sides, and rearrange: 
2у-х-4- z 
2 


Multiply both sides by z/(2y — x — 4): 


y 


E ER 


Parentheses are used to isolate part of an expression in order to select a sub- 
expression that is manipulated in a particular way. For example, the parentheses 
in c(a + b) + d ensure that the variables a and b are added together before being 
multiplied by c, and finally added to d. 

Occasionally, we have to divide a pair of numbers such as 56/3 7: 18.6667, where 
the remainder 6 is repeated indefinitely. To remind the reader this is written 18.6. 


4.3.1 Solving the Roots of a Quadratic Equation 


Problem solving is greatly simplified if one has solved it before, and having a good 
memory is always an advantage. In mathematics, we keep coming across prob- 
lems that have been encountered before, apart from different numbers. For example, 
(a + b)(a — b) always equals a? — P?, therefore factorising the following is a trivial 
exercise: 


a? — 16 = (a + 4)(а — 4) 
x? — 49 = (x +7)(x — 7) 
eae (r+) (x= 4) 


A perfect square has the form: 
а? + 2ab + D? = (a + by 
Consequently, factorising the following is also a trivial exercise: 


а? + 4ab + 4b? = (a + 25)? 
x? + 14x +49 = (x +7)? 
x? — 20x + 100 = (x — 10)? 


Now let's solve the roots of the quadratic equation ax? + bx + c = 0, i.e. those 
values of x that make the equation equal zero. As the equation involves an x? term, 
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we will exploit any opportunity to factorise it. We begin with the quadratic where 
а #0: 
ax? +bx+c=0 


Subtract c from both sides to begin the process of creating a perfect square: 
2 = 
ax” + bx = -с 


Divide both sides by а to create an x? term: 


› В с 
х+-х=-- 
а а 


Add b?/4a? to both sides to create a perfect square on the left side: 


b? b? 
402 4a2 а 


2 b 
х + -х + 
а 


Factorise the left side: 


Make 4а? the common denominator for the right side: 


wie > _ b= дас 
XF a) “д? 


Take the square root of both sides: 


b +b? — 4ac 
Xx + = 
2a 2a 
Subtract b/2a from both sides: 
УР? — 4ас b 
X = 
2а 2а 
Rearrange the right side: 
—b + Vb? — Дас 
x= —— 
2a 


which provides the roots for any quadratic equation. 
The discriminant vb? — 4ac may be positive, negative or zero. A positive value 
reveals two real roots: 
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Fig. 4.1 Graph of 
у=х?+х—2 


—b + Jb? — Дас —b — Jb? — Дас (42) 
о = — . 
2a 


X] = 
2a 


A negative value reveals two complex roots: 


| b+ y|P? — 4ac| i | —b-— y|P? — 4ac| i (4.3) 


2a cm 2a 


xl 


Readers may have noticed the use of | in (4.3), which are used to indicate that the 
enclosed expression is always positive. And a zero value reveals a single root: 


—b 


= 
2а 


For example, Fig. 4.1 shows the graph of y = x? + х — 2, where we can see that 


у = 0 at two points: x = —2 and x = 1. In this equation: 
1 
=1 
с--2 


which when plugged into (4.2) confirms the graph: 


=1+ 41-48 


Xp = ——————— = 1 
2 
-1-4/1--8 
X2 = 2 = 2 


Figure 4.2 shows the graph of у = x? + x + 1, where at no point does у = 0. In 
this equation: 
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Fig. 4.2 Graph of LN 
2 
y=xrt+x4l 2 
-3 -2 - 0 1 2 3x 
-1 
-2 


= 1 
b=1 
c=] 


which when plugged into (4.2) confirms the graph by giving complex roots: 


=1+ 41-4 ІШЕСІЗ 
X] = = + 1 
2 2 2 
|o-1-41-4 1 УЗ. 
= 2 = XE 
Let's show that x; satisfies the original equation: 
O= x? +х +1 
| ы ECT 
== 1 1 
2 2 2 2 
_1 ww. 3 ЕСЕ 
s иот 
= 0 


x» also satisfies the same equation. 
Algebraic expressions also contain a wide variety of functions, such as: 


Мх = square root of x 
Yx = n-th root of x 
x" — x to the power n 

n! — factorial n 
sin x = sine of x 


COS X — cosine of x 
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tan x = tangent of x 
log x = logarithm of x 


шх = natural logarithm of x 
Trigonometric functions are factorised as follows: 


sin? x — cos? х = (sin x + cos x)(sin x — cos x) 


sin? x — tan? x = (sin x 4- tan x)(sin x — tan x) 


sin? х + Asinx cos x + 4cos? x = (sin x + 2 cos x)? 


sin? x — 6sinx cos x + 9cos? x = (sin x — 3cos x)? 


4.3.2 Difference of Two Squares 


A useful algebraic identity is the difference of two squares, which works as follows. 
Given two numbers a and b, then: 


(a + ba -b) 2d? cab—-ab—- № = а? – b? 


4.4 Indices 


Indices are used to imply repeated multiplication and create a variety of situations 
where laws are required to explain how the result is to be computed. 


4.4.1 Laws of Indices 


The laws of indices are expressed as follows: 


а" x а" = q^ 
m 

a Lg" 
a” 


and are verified using some simple examples: 
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2 х 22 = 25 – 32 


ЩЕ 

2 = 25 = 4 
(2)! = 26 = 64 

From the above laws, it is evident that: 

ад СА 1 

а? = EX 
ap 
ai = “а 
az = Хар 


4.5 Logarithms 


Two people are associated with the invention of logarithms: the Scottish theolo- 
gian and mathematician John Napier (1550-1617), and the Swiss clockmaker and 
mathematician Joost Bürgi (1552-1632). Both men were frustrated by the time they 
spent multiplying numbers together, and both realised that multiplication could be 
replaced by addition using logarithms. Logarithms exploit the addition and subtrac- 
tion of indices shown above, and are always associated with a base. For example, 
Ша" =n, Шеп log, n = x, where а is the base. Where по base is indicated, it is 
assumed to be 10. Two examples bring the idea to life: 


10? = 100 then log 100 =2 
10° = 1,000 then log1,000 = 3 


The first example is interpreted as “10 has to be raised to the power (index) 2 to 
equal 100.’ The second is interpreted as “10 has to be raised to the power 3 to equal 
1000.’ The log operation finds the power of the base for a given number. Thus а 
multiplication is translated into an addition using logs. Figure 4.3 shows the graph 
of log x, up to x = 100, where we see that log 20 ~ 1.3 and log 50 ~ 1.7. Therefore, 
given suitable software, logarithm tables, or a calculator with a log function, we can 
compute the product 20 x 50 as follows: 


20 х 50 = log 20 + log 50 © 1.3 + 1.7 = 3 
10° = 1, 000 
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Fig. 4.3 Graph of log x 
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Fig. 4.4 Graph of In x 


a 
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In general, the two bases used in calculators and software are 10 ande = 2.71828.... 
To distinguish one type of logarithm from the other, a logarithm to the base 10 is 
written as ‘log’, and a natural logarithm to the base e is written ‘In’. 

Figure 4.4 shows the graph of In x, up to x = 100, where we see that In 20 = 3 
and In 50 ~ 3.9. Therefore, given suitable software, a set of natural logarithm tables 
or a calculator with a In function, we can compute the product 20 x 50 as follows: 


20 x 50 = ш20+ ш50 ~ 3 + 3.9 = 6.9 
еб ~ 1, 000 


From the above notation, it is evident that: 


log(ab) = loga + logb 


= (5) = loga — logb 


log(a") = n loga 
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4.6 Further Notation 


All sorts of symbols are used to stand in for natural language expressions-here are 
some examples: 


< less than 

> greater than 

< less than or equal to 

> greater than or equal to 
7: approximately equal to 
= equivalent to 

not equal to 


|x| absolute value of x 


For example, 0 < т < 1 is interpreted as: ¢ is greater than or equal to 0, and is less 
than or equal to 1. Basically, this means / varies between 0 and 1. 


4.7 Functions 


The theory of functions is a large subject, and at this point in the book, I will only 
touch upon some introductory ideas that will help you understand the following 
chapters. 

The German mathematician Gottfried von Leibniz (1646-1716), is credited with 
an early definition of a function, based upon the slope of a graph. However, it was 
the Swiss mathematician Leonhard Euler (pronounced Oiler) (1707-1783), who 
provided a definition along the lines: ‘A function is a variable quantity, whose value 
depends upon one or more independent variables.’ Other mathematicians have intro- 
duced more rigorous definitions, which are examined later on in the chapter on 
calculus. 


4.7.11 Explicit and Implicit Equations 


The equation: 
у = 3х? +2х +4 


associates the value of у with different values of x. The directness of the equation: 
‘y =’, is why it is called an explicit equation, and their explicit nature is extremely 
useful. However, simply by rearranging the terms, creates an implicit equation: 
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4 = y— 3х2 — 2х 
which implies that certain values of x and y combine to produce the result 4. Another 
implicit form is: 
0= у – 3х2 —2х—4 

which means ће same thing, but expresses the relationship in a slightly different 
way. 

An implicit equation can be turned into an explicit equation using algebra. For 
example, the implicit equation: 


4х + 2у = 12 


has the explicit form: 
y=6-2x 


where it is clear what y equals. 


4.7.2 Function Notation 


The explicit equation: 
у = 3х2 +2х +4 


tells us that the value of у depends on the value of x, and not the other way around. 
For example, when x = 1, y = 9; and when x = 2, y = 20. As y depends upon the 
value of x, it is called the dependent variable; and as x is independent of y, it is 
called the independent variable. 

We can also say that y is a function of x, which can be written as: 


y= f(x) 
where the letter ‘f’ is the name of the function, and the independent variable is 


enclosed in brackets. We could have also written y = g(x), y = h(x), etc. 
Eventually, we have to identify the nature of the function, which in this case is: 


f(x) = 3x? +2х +4 
Nothing prevents us from writing: 
у = f(x) = 3х2 +2x44 


which means: y equals the value of the function f (x), which is determined by the 
independent variable x using the expression 3x? + 2x + 4. 
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An equation may involve more than one independent variable, such as the volume 
of a cylinder: 
У =лг?һ 


where r is the radius, апа Л, the height, and is written: 


V(r, А) = xr?h 


4.7.3 Intervals 


An interval is a continuous range of numerical values associated with a variable, 
which can include or exclude the upper and lower values. For example, a variable 
such as x is often subject to inequalities like x > a and x < b, which can also be 
written as: 

a<x<b 


and implies that x is located in the closed interval [a, b], where the square brackets 
indicate that the interval includes a and b. For example: 


1<х < 20 
means that x is located in the closed interval [1, 20], which includes 1 and 20. 
When the boundaries of the interval are not included, then we would state x > a 
and x < b, which is written: 


a<x<b 


and means that x is located in the open interval |а, b[, where the reverse square 
brackets indicate that the interval excludes a and b. For example: 


1<х < 10 


means that x is located in the open interval |1, 10[, which excludes | and 10. 
Closed and open intervals may be combined as follows. If x > a and x < b then: 


a<x<b 
and means that x is located in the half-open interval [a, b[. For example: 


1<х<10 
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Fig. 4.5 Closed, open and 
half-open intervals. The 
filled circles indicate that a 


а, half-open interval 


or b are included in the a,b] half-open interval 
interval 
a, b| open interval 
a,b] closed interval 


means that x is located in the half-open interval [1, 10[, which includes 1, but not 
10. 
Similarly: 
1<х<Ь 


means that x is located in the half-open interval ]1, 10], which includes 10, but not 1. 
An alternative notation employs parentheses instead of reversed brackets: 


Ja, Ы = (a, Б) 
(а, b[ = [a, b) 
Ja, b] = (a, b] 


Figure 4.5 shows open, closed and half-open intervals diagrammatically. 


4.7.4 Function Domains and Ranges 


The following descriptions of domains and ranges only apply to functions with one 
independent variable: f(x). 
Returning to the above function: 


у= f(x) = 3х2 -2x +4 
The independent variable x, can take on any value from —oo to +00, which is called 
the domain of the function. In this case, the domain of f (x) is the set of real numbers 
IR. The notation used for intervals, is also used for domains, which in this case is: 


1- oo, -Foo[ 


and is open, as there are no precise values for —oo and +00. 
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As the independent variable takes on different values from its domain, so the 
dependent variable, y or f(x), takes on different values from its range. Therefore, 
the range of у = f(x) = 3x? + 2x + 4 is also the set of real numbers В. 

The domain of log x is: 

10, +оо[ 


which is open, because x = 0. Whereas, the range of log x is: 
] = оо; +оо[ 


The domain of Их is: 
[0, +оо[ 


which is half-open, because 4/0 = 0, апа +00 has no precise value. Similarly, the 
range of ./x is: 
(0, +оо[ 


Sometimes, a function is sensitive to one specific number. For example, in the func- 
tion: 
1 
y= }(х) = — 


х-і 


when x = 1, there is а divide by zero, which is meaningless. Consequently, the 
domain of f (x) is the set of real numbers В, apart from 1. 


4.7.5 Odd апа Even Functions 


An odd function satisfies the condition: 


Дх) = —f@) 


where x is located in a valid domain. Consequently, the graph of an odd function is 
symmetrical relative to the x-axis, relative to the origin. For example, sina is odd 
because: 

sin(—a@) = — sin œ 


as illustrated in Fig. 4.6. Other odd functions include: 


f(x) = ах 
Р(х) = ax? 
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Fig. 4.6 The sine function is sin a å 
an odd function 


> 
a 
Fig. 4.7 Тһе cosine function cos ФА 
is an even function 
= 
a 


An even function satisfies the condition: 


Р(х) = Ло) 


where x is located in a valid domain. Consequently, the graph of an even function is 
symmetrical relative to the f (x) axis. For example, cos o is even because: 


cos(—a@) = cosa 
as illustrated in Fig. 4.7. Other even functions include: 


/(х) = ах? 
/(х) = ах* 


4.7.6 Power Functions 


Functions of the form f(x) = x” are called power functions of degree n and are 
either odd or even. ІЕп is an odd natural number, then the power function is odd, else 
if п is an even natural number, then the power function is even. 
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4.8 Series 


Many constants and functions can be represented as a series of terms, which are 
summed or represented as a product. This is very convenient in computing, as it 
means that accuracy is maximised. The Greek symbol sigma У is used to stand for 
‘sum’, whereas the Greek symbol pi П is used for ‘product’. 

Here are a list of such series, which are given without proof. 


4 16 36 64 o 412 
Ae т Selle 
oo 
ed ttit 27 
емен Y 
ше, - mL ғ, “Lan 2п+1 
свх = 1-4 aoe Y" 
drm Tm. - У 


4.9 Binomial Theorem 


In mathematics, the term ‘binomial’ means the sum or difference of two terms. When 
the binomial (a + Б) is raised to an integral power, we can invoke the binomial 
theorem. For example, the first six terms are: 


(a+b) = 1 

(a + b)! 2 la + 1b 

(a + b = 1a? + 2ab + 1b” 

(a + Б)? = la? + 3a7b + За? + 1p? 

(a + b)* = la^ + 4а3Ь + ба?Ь? + A4ab? + 1b* 

(a + Б)? = 1a? + 5afb + 10а35? + 10а2Ь3 + Sab^ + 1p? 
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Any term is expressed by the following: 


(a+b) = n а" + п ар + п a"?p? + 
NE 1 2 UU 


Equation (4.4) can be expressed as the sum: 
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n п—1 п п 
+ ab + b 
n—i n 


(4.4) 


(a b) = У’ 


( 


where (") stands for: 


n h 
( Jens 

, І 

і-0 


п) _ п! 
и =й 


and means ‘the number of combinations of и elements selected i at a time.’ For 


0) 
() 
Ө 
0) 


3x2xl1 
CBx2x1) 

2х2х1 
“Оху 

2х2х1 
TOO xZxl- 
2 03x2x1 _ 
^ 3x2x1 — 


which are the coefficients in the expansion (a 4- by. 

The pattern of coefficients is known as ‘Pascal’s triangle’ and part is shown in 
Table 4.1. Note, that any row of numbers, is closely related to the previous row. 
And even though it has been named after the French mathematician Blaise Pascal 
(1623—1662), the triangle of numbers was known in China many centuries previously. 


4.10 Summary 


The above description of algebra should be sufficient for the reader to understand the 
following chapters. However, one should remember that this is only the beginning 


of a very complex subject. 


4.11 Worked Examples 


Table 4.1 Pascal’s triangle 


n=0 1 

m= 1 1 

MEA 1 2 1 

ines 1 3 3 1 
n=4 1 4 6 4 1 
п=5 1 5 10 10 5) 
n=6 1 6 15 20 15 6 


4.11 Worked Examples 


4.11.1 Algebraic Manipulation 


Rearrange the following equations to make y the subject: 


ga ад | х+ 68 x + 68 
329. 23-2” 3 — siny 
Solution 

7 +4 

Se 

x+4 

3-у- 
ы: 
х +4 

=3— 
d 7 

Ae 

"33 

Solution 

68 

Ba 

34+5 

ЖАЛЫ ы. 

e" 23 

1 x + 68 3 
e 23 
х-1 


71 


72 


Solution 
3 = аш 
3 — sin y 
| x + 68 
3 — sin y = 
23 
ЕЗГЕ x + 68 
ды 23 
lex 
|. 23 


4.11.2 Solving a Quadratic Equation 


Solve the following quadratic equations, and test the answers: 


О= х2 +4х +1, 0-2х2--4х--2, 022x? +4х+4 


Solution 
O=x7+4x +1 
—b + A/D? — дас 
б (а -— 
2а 
_ —-4+/16—4 
Е 2 
1—4 +12 
= 2 
= —2+/3 
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4.11 Worked Examples 
Test with x = —2 + v3: 


x? + 4х +1 = (—2 + 73)" +4(—2+ 4/3) +1 


—4—44343—8- 44/341 
=0 


Test with x = —2 — 4/3: 


y + 4х + 1 e(-2ueq/3)^ +4(—2 — v3) + 1 
=444/34+3-8-4V/3+1 
=0 


Solution 


| 
N 
= 

N 
+ 
A 
я 
+ 
N 


0- 


Test with x = —1: 


2x? +4x4+2=2-4+42 
=0 


Solution 


0 = 2х2 + 4х +4 
—b + Jb? — Дас 


73 
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Test with x = —1 + i: 


2x? + 4x + 4 = 2(-14 i +4(-1+1+4 
=21-21-10-4+4+4 
= —4i + 4i 
=0 


Test with x = —1 — i: 


2x? + Ах +4 2 2(-1— i? +4(-1-i) +4 
—2(14-2i— 1) 4—4i 4-4 
= 4i — 4 
=0 


4.11.3 Factorising 


Factorise the following equations: 


Asin? x — 4cos* x 


9 sin? x + 6sinx - cos x + cos? x 


25 sin? x + 10sinx - cos x + cos? x 


Solution 


Asin? x — 4cos? x — (2sin x + 2 cos x)(2sin x — 2 cos x) 
9 sin? x + 6sinx - cos x + cos? x = (3 sinx + cos x)? 


25 sin? х + 10 sin x - cos x + cos? x = (5 sin x + cos xy 


4.11.4 Binomial Theorem 


Show the expansion (2a + 3b)” for n = 0 to 3. 
Solution 


Qa + 3Ь)0 = 1 


(2а + ЗЬ)! = 2a + ЗЬ 


References 


Qa + 3b)? = 4a? 
(2a + 3b)? = 8a? 
References 
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2 x 2 x 3ab + 9b? = 4a? + 12ab + 9b? 
3 x 4 x За +3 x 2 x Sab? + 27Ь3 = 8a? + 36a2b + 54ab? + 27b? 


Russell B, Whitehead AN (1903) Principia mathematica. Cambridge University Press 
Russell B (1938) [First published 1903] Principles of mathematics, 2nd ed. WW Norton & Co 


Chapter 5 A) 
Logic сыны 


5.1 Introduction 


This chapter divides into three sections: truth tables, logical premises and set theory. 
The reader is introduced to truth tables using a tabular form of expressing logical 
outcomes. The section on logical premises introduces the idea of logical reasoning 
using constructs such as material equivalence, implication and reductio ad absurdum. 
The third section is an introduction to set theory including set building, empty sets, 
subsets, supersets and universal sets, etc. The chapter concludes with some worked 
examples. 


5.2 Background 


The English mathematician George Boole (1815-1864), is regarded as the ‘father’ 
of boolean algebra, which is why it bears his name. He did not associate logic 
with mathematics, but wanted to devise a logical framework for expressing and 
analysing logical statements. A logical statement contains one or more premises (or 
propositions), that form the basis of an argument. However, not all premises are 
true, and starting from an incorrect premise is not a good strategy for winning an 
argument, therefore, one must anticipate the existence of valid and invalid premises. 
Complex arguments often combine individual premises using the logical connectives 
negation (NOT), conjunction (AND), inclusive disjunction (OR) and the exclusive 
disjunction (XOR). Today, logic is considered to play a central role in mathematics, 
and Bertrand Russell believed that mathematics could be derived entirely from logic. 


© Springer Nature Switzerland AG 2024 77 
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5.3 Truth Tables 


In algebra, variables can assume an infinite range of numerical values, and equations 
often require careful analysis to discover their roots. However, in logic, variables 
possess two states: true (T) or false (F), or the binary states: 1 and 0, respectively. 
Consequently, when two or more logical variables are combined, the number of pos- 
sible combinations is finite, and often restricted to a dozen or so. These combinations 
are easily summarised in the form of a truth table, which automatically reveals the 
logical roots of the statement being investigated. For example, a single variable p 
can only assume two possible logic states Т or F, or using binary 1 or 0, as shown іп 
Table 5.1. With two variables p and q, the number of combinations increases to 
4 = 22, as shown in Table 5.2. With three variables p. q and r, the number of 
combinations increases to 8 — 2?, as shown in Table 5.3. 


5.3.1 Logical Connectives 


Over the past century logicians have developed a framework for describing and 
analysing logical statements using propositions and logical connectives. Unfortu- 
nately, the logical connectives developed by logicians are not found on a computer 
keyboard, and the computer science community has replaced them by other charac- 
ters. Table 5.4 shows the correspondence between the logical connectives used by 


Table 5.1 Truth table for one logic variable 


р 


Т 
Е 


Table 5.2 Truth table for two logic variables 


т [3 |9 E 
ч [ы |ы E 
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Table 5.3 Truth table for three logic variables 


раг 


4L 
T 


29| 
ні 


159) 
"i 


Table 5.4 Correspondence of logical connectives 
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Operation Meaning Logic Computer Science | Priority 
Negation NOT = lorc 1 
Conjunction AND A & 2 
Inclusive disjunction OR V — 3 
Exclusive disjunction XOR 5% X 3 
Implication IMPLIES > > 4 
Equivalence EQUIVALENCE © > 4 


logicians and computer scientists. It also shows the priority of each connective to 


control the sequence of evaluation. 


5.4 Logical Premises 


5.4.1 Material Equivalence 


The premises “Descartes is human’ and ‘Descartes is mortal’ are both true, but they 
are not materially equivalent. For example, you don’t have to be human in order to be 
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Table 5.5 Equivalence: p <> 4 


Pigi p e aq 
ШИ T 
TIE Е 
ЕШ Е 
ЕЕ d 


mortal, but you do have to be mortal in order to be human. Therefore, being mortal 
is a necessary condition for being human, but is not sufficient, because Descartes 
could be the pet name for a donkey, which although mortal, is not human. 

Now consider the premises ‘Descartes is human’ and ‘Descartes is a man’, where 
once again they are true, but not equivalent. For example, you don’t have to be a man 
to be human, but you do have to be human to be a man. Therefore, being a man is a 
sufficient condition for being human, but is not necessary, because Descartes could 
refer to Madame Descartes, who is not a man, but still human. 

Specifically, if p and q are premises such that p implies q, then p is a sufficient 
condition for q, and q is a necessary condition for p. If p is both a necessary and 
sufficient condition for q to be true, then p and q are logically equivalent. Such a 
condition is usually expressed using iff. For example, ‘Descartes is married iff he has 
a wife.’ This is also expressed as ‘Descartes is married iff he has a wife.’ Therefore 
p and q are equivalent if p is true and q is true, and p is false and q is false. This 
material equivalence is written p < q as shown in Table 5.5. 


5.4.2 Implication 


One observation in logical reasoning is that a false premise can lead to a false or true 
conclusion. Furthermore, it is false that a true premise leads to a false conclusion. 
However, a true premise must always lead to a true conclusion. Such a relationship 
is called implication. For example, if n є N then n? є №, and if n ¢ N then n? ¢ М. 
However, starting with a false value of п, such as и = 2 + 21 then it is false that 
n? Е N. But starting with и = —i, which is also false, leads to n? = 1 which satisfies 
n? € N. Therefore, given two premises p and q, where p implies q, this is written 
р — 9, as shown in Table 5.6. 

Let's examine the action of the other connectives using truth tables. In some cases, 
different combinations of T and F produce a true or false result, whilst others always 
produce a true result. This latter condition is called a tautology. 
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Table 5.6 Implicationp — 4 


HE p> aq 
ШЕ T 
TIE F 
ЕШ т 
ЕЕ т 


Table 5.7 Negation: ^p 


р|-р 
ШЕ 
EDT 


5.4.3 Negation 

Negation is the act of reversing a logical state. For example, if p is false, then —р is 
true, and vice versa, as shown in Table 5.7. 

5.4.4 Conjunction 


Conjunction is the linking together of two or more premises using the wedge ‘A’ 
connective. Table 5.8 shows the action of conjunction with two premises p and q, 
where p ^ q is true only when both p and q are true, otherwise it is false. Table 5.9 
shows the result with three premises. 


5.4.5 Inclusive Disjunction 


Table 5.10 shows the action of inclusive disjunction, where p V q is true when either 
p or q, or both are true, otherwise it is false. 
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Table 5.8 Conjunction: p A q 


р а рла 
ШЕ T 
NUS Е 
ЕШ Е 
в Е 


Table 5.9 Conjunction: pA qAr 


рлалг 


= 
2 
ң 


T T 


3 [3 


F E 


т 
= 
ег 


=] 
ні 
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"ri 
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Table 5.10 Inclusive disjunction: p У q 


piq pyg 
ШЕ T 
Ши T 
E T 
КІШ Е 
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Table 5.11 Exclusive disjunction: p Фа 


р а Peq 
BUS Е 
ШЕ) T 
ЕШ T 
RES F 


5.4.6 Exclusive Disjunction 


Table 5.11 shows the action of exclusive disjunction, where p Ө q is true when 
either p or q is true, but not both, otherwise it is false. 


5.4.7 Idempotence 


Tables 5.12 and 5.13 contain the unusual term: idempotence, which was introduced 
by the American mathematician Benjamin Peirce (1809-1880), to clarify certain 
mathematical or logical operations, and literally means ‘having the same power’ 
or is ‘not affected by.’ For example, in algebra, 1 x 1 = 1, where the number | is 
idempotent (not affected by) multiplication by itself. In logic, p v р = p, where У 
is idempotent when associated with two equal premises, and gives the same result 
p. Table 5.12 shows the idempotence of v, and Table 5.13 shows the idempotence 
of ^. 


5.4.8 Commutativity 


In arithmetic, the order of elements does not affect the numerical result. For exam- 
ple, 4 4- 6 =6- 4 and xy = yx. This is called commutativity. In logic, something 


Table 5.12 Idempotence of v: p <> pvp 


PPVp|P <> PVP 


ГІТ ШІ 


ЕСЕ T 
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Table 5.13 Idempotence of A:p <> pAp 


р |рлр]|р <> PAP 


Ии ү i 


ЕЕ т 


Table 5.14 Commutativity of v: руд <> qvp 


PqPpvqiqvpipvq«qvp 
ЖҮЛ Т T 1 
ШЕСІ T T 
ЕТТ T 1 
[ed de СЕ B ИВ 


Table 5.15 Commutativity of A: pA q <> q^p 


P/G|PAQG|GAP/PAG © q^p 
ШИН ob T E 
TEIE F T 
PEE Ẹ T 
СЕ Е Т 


similar exists when two premises are linked together with У or A. For example, 
p V q is identical to q v p, and p ^ q is identical to q ^ p. Table 5.14 shows the 
commutativity of v, and Table 5.15 shows the commutativity of ^. 


5.4.9 Аѕѕосіайуйу 


Generally in algebra, the grouping of elements does not affect the numerical result. 
For example, 2 + (3 + 4) = (2 + 3) + 4 and x(yz) = (xy)z. This is called associa- 
tivity. In logic, something similar exists when two or more premises are linked 
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together with v or ^. For example, (p V q) v r is identical to p v (q v r), and 
(р A q) Ar is identical to p ^ (q л г). Table 5.16 shows the associativity of v, and 
Table 5.17 shows the associativity of ^. 


Table 5.16 Associativity of v: (pV q) vr <> pv(qvr) 


ра (ру 9) Уг |р“ (9 Ут) | (ру) “г e ру (9 Уг) 
ТТ T m T 
TIT T T T 
TIF T T T 
T| F T T T 
ЕТ T T T 
FÍT T T T 
ЕЕ т T T 
ЕЕ Е Е Т 


Table 5.17 Associativity of A: (рл 9) Ar <> рл (qAr) 


pja (рлдллгірл(алг) | рлфлг <> рл (ал, 
тіт T E T 
TIT F F T 
T| F F F T 
T| F F F T 
FÍT F F T 
F|T F F Т 
ЕЕ Е Е T 
ЕЕ Е Е T 
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5.4.10  Distributivity 


5 Logic 


The distributive law of algebra permits us to expand x (y + z) into xy + xz. Similarly, 
in logic, the laws of distributivity permit us to writep A (q v r) <> (p^q)v(p^r) 


апару (q^r) © (ру 9) л (pv г), as shown in Tables 5.18 and 5.19. 


Table 5.18 Distributivity of ^ over v: pA (qyr) <> (рл 9) У par) 


р |а гірл(ауг) (рл ФУ(рлг)|рл(ауг) © (рл) У par) 
DIET T T T 
ЕЕ т T T 
FIET T T T 
ТЕЕ Е Е т 
РТТ Е Е T 
ЕТЕ Е Е T 
ЕЕТ Е Е T 
F| FIF F F T 


Table 5.19 Distributivity of v over A: pv (9 лг) <> (pV q)A (pvr) 


р |а гру (алг) | (pvq)^(pvr)|pv(qAr) © (руа)л(руг) 
TITT T T T 
ТІТЕ T T T 
ТЕТ T T Т 
ТЕЕ т T T 
FITT F F T 
ЕТЕ Е Е Т 
ЕЕТ Е Е T 
ЕЕЕ Е Е Т 
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Table 5.20 de Morgan’s Law: ^(p V q) <> —=pA-7q 


р|4(-ФУФ|-рл-а|-(руШф<>-рл-а 
DT F F T 
ТЕ Е Е T 
F/T F F T 
ЕЕ T T T 


Table 5.21 de Morgan’s Law: —(р ^ q) > ^p v ^q 


р|4/-(рАФ|-рУу-а|-(рХ4) © -ру-а 
тт Е Е T 
ТІЕ T T T 
F/T T Т T 
ЕЕ T T T 


5.4.11 de Morgan’s Laws 


The British mathematician and logician Augustus de Morgan (1806-1871), for- 
mulated what are now known as de Morgan’s Laws, as shown in Tables 5.20 
and 5.21. 


5.4.12 Simplification 


Some statements are so obvious it seems unnecessary to consider them. Nevertheless, 
their existence should be recognised, as they can help simplify complex logical 
statements. For example, p V T must always be true, irrespective of p. Similarly, 
p AF must always be false. Table 5.22 shows the equivalence p v T < T, and 
Table 5.23 shows the equivalence p A F <> F. Another form of simplification arises 
with p v F and p ^ T, which both equal p, as shown in Tables 5.24 and 5.25. 
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Table 5.22 Simplification: p v T < T 


р|рУТ|руТеТ 


Dee T 


ESSE T 


Table 5.23 Simplification: pA F < F 


р | рлЕ | pAF< Е 


ІШ Е T 


ЕЕ T 


Table 5.24 Simplification: p VF < p 


р | рУЕ | рУЕ <> p 


МҚШ) ШТ! 


BCF Т 


Table 5.25 Simplification: pA T < p 


р |рАТ P^T) e p 


шт T 


ЕЕЕ Т 


5.4.13 Excluded Middle 


A condition known as the excluded middle arises with the choice p v =p, which, 
after a little thought, must always be true. For when p is true, its negation is false, 
and vice versa, which guarantees either scenario being true, as shown in Table 5.26. 
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Table 5.26 Excluded middle: (p v ^p) Т 


P | “P | P V “P 
DE T 
TS К T 


Table 5.27 Contradiction: (p^ =p) > F 


р| р] рл р 
TICE F 
EST F 


5.4.14 Contradiction 


A condition known as contradiction arises with the combination p ^ —p, which after 
a little more thought, must always be false. For p and —p can never be equivalent, 
making a conjunction impossible, as shown in Table 5.27. 


5.4.15 Double Negation 

Knowing that two negatives make a positive, it will come as no surprise that 
-(-р) <> р, as shown in Table 5.28. This is known as double negation. 

5.4.16 Implication and Equivalence 


The truth table for implication has already been covered in Table 5.6, how- 
ever, implication is also expressed by р > q <> -РУ q, as shown in Table 5.29. 


Table 5.28 Double negation: -(-р) <> р 


р | —(—р) | -(-р) ер 


T т T 


F Е Т 


90 5 Logic 


Table 5.29 Implication: р > q <> -руя 


ВЕЕР р >94 <> рма 
TIT, T di М 
TIE Ẹ F aj 
БІТ Т Hi Ш 
ЕЕ T T Ak 


Table 5.30 Equivalence: (р <> q) > (P > Ф) л (q> p) 


рареар-ча(|4а-рА-(р>9/(а-р) |(р= 94) © А 
ТТТ T ШІ т T 
ТІЕ| Е F T F Т 
ЕТ Е T F F T 
ЕЕ T T T T T 


Similarly, the truth table for equivalence (Table 5.5) is also expressed by (p <> q) 
<> (р > 9) л (q > р), ав shown in Table 5.30. 

5.4.17 Ехрогіайоп 

Exportation covers the equivalence (p ^ q) > г <> p — (q -> г), as shown in 
Table 5.31. 

5.4.18 Contrapositive 


The law of contrapositive, is also known as modus tollens, and acknowledges the 
negative form of p —> q, i.e. ^q — —р, as shown in Table 5.32. 
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Table 5.31 Exportation: (p^ 9) > г < p> (q— г) 


r Az(p^q)—^riq—5r|Bzp (q—r)|A-e В 


= 
2 
= 

> 
2 


tom о чоң оч оя 
" "d d 7 7 оч d 
" d "d "d m ы 
ч ч о оол о ones; =ч 
dd dd іі тоя 
а 
d dd dd ч шоя 
d dd dd оя эя я 


Table 5.32 Contrapositive: p > q <> ^q —> —p 


Pee ГЕ DP-d:--'"d—'"P 
ЖТ КЕГЕ T T T 
ИІ БИГЕ Т E F 1 
ЕТТЕ T Т ТЕ 
ГЕ 17 Т T A 


5.4.19 Reductio Ad Absurdum 


Reductio Ad Absurdum is Latin for ‘reduction to absurdity’ and describes a form 
of argument that proposes a statement is true, by claiming that its opposite leads to 
an impossible or absurd result. For example, *men must have knees, otherwise they 
wouldn't be able to kneel down and propose!' There is a wonderful story concerning 
Bertrand Russell, who, during a lecture on logic, mentioned that in the sense of 
material implication, a false proposition implies any proposition. A bright student 
raised his hand and said ‘In that case, given that 1 = 0, prove that you are the Pope.’ 
Russell immediately replied, ‘Add 1 to both sides of the equation: then we have 
2 = 1. The set containing just me and the Pope has 2 members. But 2 = 1, so it has 
only 1 member; therefore, I am the Pope.’ 
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Table 5.33 Reductio ad absurdum: (p > q) ^ (p > ^q) > —p 


р > q|p— —q| (P> q)A (р> -9 | (P 9) A (р ~q) > -Pp 


= 
2 


шоп d 4 
" d 7 4 


Е Т 
Е a 
T T 
T 1. 


d в H 


Е 
F 
T 
T 


Table 5.34 Reductio ad absurdum: (^p — q) ^ (=p > ~q) > p 


БТ UID Ср а A (=p > —q) > p 


= 
X 


шош d dg 
7 d m"-" d 
шоп d d 


T ТІ 
T T 
E F 
T F 


d d оч 4 


We can express this fallacious form of argument by examining the conjunc- 
tion: (р — q) ^ (p — ~q), which posits that p implies q and ^q, which is absurd. 
Table 5.33 reveals that (p — q) ^ (p —> ^q) — —p, which is a useless result, and 
Table 5.34 shows its negative form. 


5.4.20 Modus Ponens 


Modus ponens is Latin for affirming mode and describes an argument containing 
three parts: a major premise, a minor premise, and a conclusion. For example, ‘If 
an integer is positive, then it is a natural number’, is the major proposition. ‘23 is 
a positive integer', is the minor proposition. From which, we conclude that 23 is a 
natural number. By letting p stand for a positive integer, and q stand for a natural 
number, the major proposition takes the form p — q. The minor proposition is simply 
p. which makes (p — q) ^ p. which in turn, implies q, as shown in Table 5.35. 


5.4 Logical Premises 


Table 5.35 Modus Ponens: (р > а) \p >q 


ра р>а (р> 4) ^р (р> 4 ^р 9 
TIT, T ЛЕ Т 
ТЕ F T Т 
БҮРІ Е Т 
ЕЕ T г Т 


5.4.21 Proof by Cases 
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Table 5.36 shows the principle known as proof by cases where if at least one of p or q 
is true, and each implies г, then г must be true as well: (ру 9) ^ (p > r) л (q > 


r) >r. 
Truth tables are extremely useful in the design of microelectronic logic gates, 


whose elements contain AND, OR, NOT, NAND, NOR, XOR and XNOR. They 
also play a role in clarifying logical outcomes in programming, but their principal 
weakness is their size, which is proportional to 2”, where n is the number of logical 


premises. The above truth tables are summarised in Table 5.37. 


Table 5.36 Proof by Cases: (ру 9) ^ (р > r) л (9 > г)] > г 


= 
2 


" ҥш поч ва а-а оя 
" "d Jd *" *" 4d d 
теа " d *" d " d 


r 


руа|р-г|а-г|А-(руа)л(р 
T T T T 
T F F F 
T T T T 
T F T F 
T T T T 
T T F F 
F T T F 
F T T F 


A 


> Y) л (а 


» r)] 


i d 
ч 


dd о чоч а 4-4 я 
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Table 5.37 Logical identities 


Identity Law 
= (Sp) р Double negation 
pvT<T Simplification 
pAF © Е Simplification 
pvF өр Simplification 
p^T < p Simplification 
ру-рет Excluded middle 
p^^p e Е Contradiction 
PYp<p Idempotence of v 
p^p — p Idempotence of ^ 
pvq-«eqvp Commutativity of V 
p^q <> qap Commutativity of ^ 


(pvq)vr <=> pv(qvr) 
Pagar <=> p^(q^r) 
худ Ер ag 
-(рл4) > -ру-а 
p^(qvr) © (рла)у(рлг) 
ру(алг) <> (руа) л (руг) 
р> 4 <> -рУу4 
р <>а <> (р Фл(а->р) 
(рл 4 > г © p> (qr) 


р==ц фетр 


(p> q)A (p> ^q) > p 


(posu 2-9 түр 
(р> 9 л9 > 9 


(рул (р> г) л (9 > г) > г 


Associativity of V 
Associativity of A 
de Morgan’s Law 
de Morgan’s Law 
Distributivity of A over V 
Distributivity of V over A 
Implication 
Equivalence 
Exportation 
Contrapositive 
Reductio ad absurdum 
Reductio ad absurdum 
Modus ponens 


Proof by cases 
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5.5 Set Theory 


We have already covered some of the ideas behind set theory, especially Cantor's 
work in classifying infinite sets. Bertrand Russell's paradox showed that the set of all 
sets could never be a set referenced by another set, which kept alive the search for an 
alternative system. In 1902, the German logician and mathematician Ernst Zermelo 
(1871-1953), published a paper on adding transfinite cardinals, and in 1908 pub- 
lished another paper on Zermelo Set Theory. This was developed by the Norwegian 
mathematician Thoralf Skolem (1887-1963), and the German-born Israeli mathe- 
matician Abraham Fraenkel (1891—1965), which resulted in the Zermelo-Fraenkel 
set theory, abrieviated to ZF. This system builds upon the empty set and Cantor's 
power set, but does not accept that all collections of sets constitute a set, which 
prevents recursive scenarios. 

In the late 19th-century, the English logician and philosopher John Venn (1834- 
1923), revised a graphical technique using circles to represent sets, whose relation- 
ships are reflected in the nesting or intersection of the circles. He referred to them 
as Euler diagrams, as Euler had previously used them. However, Venn popularised 
their usage, which is why now they bear his name Venn Diagrams. 

Developing the definition that a set is a collection of objects, let's examine empty 
sets, set building, combining sets, and Cantor's power set, using Venn diagrams. 


5.5.1 Empty Set 


An empty set is aset with no members. For example, if a farmer owns а set comprising 
a field of five donkeys (Betty, George, Albert, Descartes, Mary) and one night they 
are all stolen, the farmer now owns an empty set in the form of an empty field, 
represented by Ø, or {}. One is tempted to question whether the field really belongs 
to the set of donkeys, to which the answer is no, but it does provide a mental device 
for visualising the concept of emptiness. Although the concept of an empty set is 
fundamental to ZF, and can be manipulated constructively, it is an abstract idea and 
remains a topic for continued discussion. 


5.5.22 Membership апа Cardinality of a Set 


We have already discovered that the symbol е means ‘member of’ or “belongs to’ 
which permits us to write: 


if S={a, е, i, о, и) then a, е, і, о, иє 5 
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In this example, the set 5 contains 5 elements, written |S| = 5, and is called the 
cardinality of S. 
A set element may also be another set. For example: 


if S= {а, Б, c, (d, e) then a, b, c, (d, e) e S 


Therefore, the set's cardinality is |S] = 4. 


5.5.3 Subsets, Supersets and the Universal Set 


The set of natural numbers N can be divided into various subsets. For example, if 
E is the set of all even natural numbers, and O is the set of all odd natural numbers, 
then Е and О are subsets of №, written E C №, and О C N. (The symbol C is also 
used in place of C.) For any problem associated with sets, there is an associated 
universal set to which the sets belong. In this case, N is the universal set, as shown 
in the Venn diagram in Fig. 5.1. 

Conversely, if E C N and O C N, then N is a superset of E and O, written 
ND E and МР О. (The symbol 2 is also used in place of D.) 

The two sets E and O can also be divided into two subsets Po for odd primes, 
and Ре for even primes, as shown in Fig. 5.2. We could have also added the subset 
of primes P, as shown in Fig. 5.3. 


5.5.4 Set Construction 


Set constructIon using the notation {< variable > | < predicate >} or { «variable: 
< predicate >}, where «variable is an arbitrary name given to the set's elements, 
and «predicate is a logical filter associated with the variable. For example, S = 
{n|n € N An > 23}, reads ‘The elements of set 5 comprise n, such that n is a natural 
number and is greater than, or equal to 23.' i.e. 5 = (23, 24, 25, 26, 27,...}. 


Fig. 5.1 Odd and even 
numbers as subsets of N N 
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Fig. 5.2 Ро and Pg as 
subsets of O and E 


Fig.5.3 Р as a subset of 
both O and E 


Неге are some more examples, where the existential quantifier 3 is introduced, 
which stands for ‘there exists’. For example, Ят е Nn = 2m, reads ‘there exists ап 
m belonging to N where n = 2m.’ 


S={n|neNal<n<5} = (1, 2, 3, 4, 5} 


S = {п | Эт € Nn = 2m} = {2, 4, 6, 8, 10,...} 
S = {п | 3m Е Nn = 2m — 1} = {1, 3, 5, 7, 9,...} 
S-í(n|liÀmeNn-2m) = 11, 4, 9, 16, 25,...} 


5.5.5 Union 


The union of two sets A and B, is another set combining their respective elements 
without duplications, and is written A U B, which reads ‘A union B’, or ‘the union 
of A and В.’ 

Figure 5.4 shows the Venn diagram for two sets A and B, which are subsets of 


the universal set U, and their union is represented by the complete shaded area. For 
example, if: 
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Fig. 5.4 The union of A and 
B: AUB U 


A = {Euler, Newton, Russell, Cantor} 
B = {Gauss, Peano, Russell, Cantor} 
AU В = (Euler, Newton, Russell, Cantor, Gauss, Peano} 


Similarly: 


А = (1, 4, 6, 8, (23, 41} 
B = {{23, 41}, 3, 5, 8} 
AUB = {{23, 41}, 1, 3, 4, 5, 6, 8} 


5.5.6 Intersection 


The intersection of two sets A and B, is another set containing their common ele- 
ments, and is written А N B, which reads “А intersection В”, or ‘the intersection of 
A and В.” 

Figure 5.5 shows the Venn diagram for two sets A and B, which are subsets of the 
universal set U, and their intersection is represented by the shaded area. For example: 


A — (Euler, Newton, Russell, Cantor) 
B — (Gauss, Peano, Russell, Cantor] 
АПВ = (Russell, Cantor} 


5.5.7 Relative Complement 


The relative complement between two sets is the set of elements belonging to one, 
but not the other. For example, the relative complement of B in A is written АХ B, 
and represents all the elements belonging to A, and not B. For example: 
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Fig. 5.5 The intersection of 
Aand B: АПВ U 


Fig. 5.6 The relative 
complement of B in A: U 
А\ В 


A = {Euler, Newton, Russell, Cantor} 
B = {Gauss, Peano, Russell, Cantor} 
А \ В = {Euler, Newton} 
B\ A= {Gauss, Peano} 


Figure 5.6 shows the relationship А \ В, and Fig. 5.7 shows the relationship В \ A. 
As this can be a difficult relationship to grasp, let’s give a formal definition of A 
in B, and Bin A: 


В\А = {х |хєеВлх ¢ A} which reads ‘an element x is in В but not A.’ 
А\ В = {х|хєАлх є B) which reads ‘an element x is in A but not В.’ 


Examples are: 


(а, Б, с, d}\ №, с, d, e) = {a} 
{john, heidi, edwin, marie} \ {edwin, marie} = {john, heidi} 
{23, 5, 41, 27, 3, 29, 2} \ 123, 5, 19, 41, 44, 29, 2} = {27, 3} 
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Fig. 5.7 The relative 
complement of A in B: 
B\A 


Fig. 5.8 The absolute 
complement A^ = И \ А 


5.5.8 Absolute Complement 


The absolute complement (or complement) of a set A is the difference between 
the associated universal set U and А, and written A^ = U \ A, as illustrated in 
Fig. 5.8. For example, if the universal set is N, and A = (n|n € NA^n > 1} = {1}, 
then A^ = N \ A} = 1. It follows that U* = Ø and * = U. 


5.5.9 Power Set 


Given a set 5, Cantor’s power set is the set of all subsets of 5, which includes 5 
and the empty set й, and is written P(S). Therefore, if S = {a}, then P(S) = {0, a}. 
Table 5.38 shows the power sets for sets with 1, 2 and 3 elements, where it become 
clear that if a set has п elements, its power set contains 2” elements. Cantor used the 
power set and the idea of one-to-one correspondence to reveal an infinite hierarchy 
of infinities. 
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Table 5.38 Power sets for different sets 


5 P(S) S| 
{a} {0, а} 2 
{a,b} (0, a, b, (a, Б} 4 


{a,b,c} | (P, a, b, c, (a, b), (b, с}, {a, с}, (a, b, с} 8 


5.6 Worked Examples 


5.6.1 Truth Tables 


Design the truth tables for: 


(p ^ q) v ^q 
(p^ q) У ^p 
(p V q) ^ ^q 
(ру) л =p 


Solutions 
See Tables 5.39, 5.40, 5.41 and 5.42. 
5.6.2 Set Building 


State the sets or the positive real numbers, and the positive integers greater than 100. 


Table 5.39 Truth table for (p ^ q) v ^q 


р|4/р/4|-4 (p^q) V—^q 
ШЕ ЖТ Е Т 
КӨЕ áp T. 
ESSE Б F E 
ЭЕ Ар Т 
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Table 5.40 Truth table for (p ^ q) v ^p 


= 
2 
= 

> 
2 


ap 


(p Aq) У p 


= " d d 
то d m" d 
" " " d 


(oll aste ыз бе 


T 
E 
i 
T 


Table 5.41 Truth table for (p V q) ^ ^q 


Pq|PVq|—q (PV q)A-q 
DC т F E 
РТ T Т 
НЫС Е Е 
БІР Б T F 


Table 5.42 Truth table for (p V q) A ^p 


р|а|руа(|-р (PY q)A—p 
EP > Е Е 
ЖЕБЕ se F F 
ЕТ Т T T 
F| F F T F 
Solutions 
5 = {х|хЕВлх > 0} 


5 = (n|n e Z^n > 100} = (101, 102, 103, 104, ...} 
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5.6 Worked Examples 


5.6.3 Sets 
Given: 


А = {1, 2, 3, 4, 5} 
В - {2, 4, 6} 


find AU B, АПВ, A\ В, BV A. 


Solution 


AU B= {1, 2, 3, 4, 5, 6) 


АПВ = (2, 4} 
AX В = (1, 3, 5) 
В\А= {6} 


5.6.4 Power Set 
Specify the power set for S = (1, 2, 3, 4}. 
Solution 


P(S) = (0.1, 2, 3, 4, (1, 2, {1, 3}, (1, 4}, 12, 3}, {2, 4}, {3, 4}, 
(1.2, 3}, {1, 2, 4}, (1, 3, 4}, {2, 3, 4}, {1, 2, 3, 4] 
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Chapter 6 (Я) 
Combinatorics Check for 


6.1 Introduction 


This chapter contains two sections: permutations and combinations. Formulae are 
derived and illustrated with examples. The chapter concludes with a variety of worked 
examples. 


6.2 Permutations 


Given a set of a finite number of distinct elements, a permutation is an ordered 
arrangement of all or part of these elements, and is sensitive to their order. Braces 
are used for unordered sets, and parentheses for ordered sets. 

As it is possible to select a group of elements, rather than the entire set, the 
following notation is often used to represent the number of permutations: P(n, k), 
where n is the number of elements in the set, and k is the number of elements taken 
at a time. Let’s illustrate this with sets of two, three and four letters. 

The set of two letters {A, B} can be ordered in two ways taking all the elements: 


Р(2, 2)=2, (A,B), (В, А) 
The set of three letters {A, B, C} can be ordered in six ways taking all the elements: 


P(3, 3) = 6, (A,B,C), (А, С, B) (B, A, С) 
(B,C, A), (С, А,В), (С, В, A) 


The set of four letters {A, B, C, D} can be ordered in twenty-four ways taking all 
the elements, P(4, 4) = 24, as shown in Table 6.1. 

Using the last example, the first letter has four possibilities; the second let- 
ter has three possibilities; the third letter has two possibilities; and the last letter 
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Table 6.1 Twenty-four ways the letters (А, B, C, D} can be ordered 


6 Combinatorics 


(A, B, C, D) 
(B. A, C, D) 
(C. A, B, D) 
(D, A, B, C) 


(A, B, D, C) 
(B, A, D, C) 
(С, A, D, B) 
(D, A, C, B) 


(A, C, B, D) 
(B, C A, D) 
(C, B. A, D) 
(D, B, A, C) 


(A, C, D, B) 
(В, C, D, A) 
(C, B. D, A) 
(D, В, С, A) 


(A, D, B, C) 
(В, D, A, C) 
(C, D, A, B) 
(D, C, A, B) 


(A, D, C, B) 
(В, D, C, A) 
(C, D, B, A) 
(D, C, B, A) 


has one possibility. Therefore, the number of permutations is P(4, 4) 24x 3x2 
x1 = 24, and P (n, n) = n!. For example, six different coloured tulip bulbs can be 
planted in a row, іп P(6, 6) = 6х5 x 4 x 3 2 = 720 permutations. 

Now let’s take ordered pairs from sets of two, three and four letters. 

The set of two letters {A, B} can be ordered in two ways taking two elements at 
a time: 

Р(2, 2) -2, (A,B), (В, А) 

The set of three letters (А, В, С) can be ordered іп six ways taking two elements 

at a time: 


P(3, 2)=6, (A,B), (A, C) 
(В, А), (В,С) 
(С, A), (C. B) 


The set of four letters (A, B, C, D} can be ordered in twelve ways taking two 
elements at a time: 


P(4, 2) = 12, (А,В), (А, С), (A, D) 
(В, А), (В,С), (B, D) 
(С, А), (С.В), (C, D) 
(D, А), (D, В), (D. C) 
A formula that satisfies these three examples is: 
P(n, 2) =п(п = 1), PQ, 2)=2, PGB, 2) =6, Р(4, 2) = 12 


Let’s derive a formula for selecting any number of elements from a set of distinct 
elements. 

Given a set of n elements, we can construct all possible permutations by selecting 
the elements one at a time. The first element can be chosen іп и different ways. The 
second element in (п — 1) ways, the third in (п — 2) ways, etc. Thus, the number of 
all permutations that are possible with n elements is: 


P(n,n)2n(n—1)(n—2) x ...x3x2x1-n! 
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If we start with n elements and construct the permutations with k positions, the first 
element can be chosen in n different ways. There are n — | elements left for the next 
К — 1 positions. The second position has п — 1 elements, the third п — 2, and so on; 
for the k® position there are и — (k — 1) elements: 


P(n, К) = n(n—1)n—2) x ...x[n — (k — 1)] 
—n(n—1)n—2)x...x(n-—k-1) 
n(n—1)n—2) х... х (п = + Dn — k)! 
= (n — k)! 
n(n —Dn—2)x...x(n—k-D(n—E)x...x3x2x1 
(n — k)! 


n! 
(n — К)! 


wheren, Ке М, и> К. 
As а simple test, when п = 4 and К = 2: 


4! 
Ра, 2) = = 12 


which agrees with our original example. 
To compute the number of ordered sets, or permutations, taking two letters at a 
time from the unordered set (C, О, M, P, О, T, E, К} we have n = 8 and k = 2: 


8! 
P(8,2)— 8-2! 
8! 
~ 6! 
=8х7 


= 56 
Table 6.2 lists the 56 two-letter sets for (C, О, M, P, О, T, E, R}. 


Table 6.2 Two-letter permutations of the word COMPUTER 
(C, E) | (C, O) (C, M) (C, P) | (C, R) | (C, T) | (C, U) 
(O, C) | (O, E) (O, M) | (O, P) | (O, R) | (O, T) | (O, U) 
(М, С) (M, E) (М, О) | (M, P) (M, В) | (M, Т) M, U) 
(Р.С) | (BE) (Р.О) | (Р.М) | (Р.К) | (Р.Т) ФО 
(U, С) | (U, E) | (U, O) | (U, M) (U, P) | (U, R) | (U, T) 
(LO | (ТЕ (Т,О) | (Т,М) (LP) (Т.К) (LU) 
(E, С) | (Е, О) | (Œ, М) | (EP) (Е, В) | Œ, T) | Œ, U) 
(К, С) | (Е, Е) (6,0) (R, М) (К,Р) (R, T) (В, U) 
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6.3 Permutations of Multisets 


A set must contain a collection of unique elements, whereas a multiset contains 
repetitions of one or more of its elements. The number of repetitions is called the 
element’s multiplicity. For example, in the multiset {M, A, N, H, A, T, T, A, N}: 


the element M has a multiplicity of 1 
the element A has a multiplicity of 3 
the element N has a multiplicity of 2 
the element H has a multiplicity of 1 
the element T has a multiplicity of 2 


and can be written (1M, ЗА, 2N, ІН, 2T]. 

Given a multiset M with n elements, an r-permutation of M is an ordered arrange- 
ments of r elements of M. However, an n-permutation of M is called a permutation 
of M. 

In order to calculate the number of permutations of a multiset M containing n 
elements, we let n; be the multiplicity for each element, where i = 1, k. Therefore, 
п + пә + пз +... Бп = п. 

In order to calculate the number of permutations of М, we begin with n free places 
for n; elements: h ) 


ПІ 


? П = П 
There now remain n — n, free places for n» elements: ( ) 
n»? 


А п = № = п2 
There now remain n — n, — n» free places for из elements: ) 


ы п ПІ n» Ұл ПЕ-1 
And so on, until the К!” elements: ) 


ny 
The total number of permutations of M is given by: 


n П = П] п Пі n» ES Пр-1 
х х...х 
П] n» nk 


n! (n — ni)! П = Пу = По = Пз – ... = Nk- 


кал Ж 
п\!(п— ni)! п›!(п—пу—п»)! n!(n — ni — n — n3 — ...— ny) 
n! 
n!no!n3!...ng! 
For example, the multiset М, A, N, H, A, T, T, A, №} = {1M, ЗА, 2N, ІН, 2T). where 


n=9, пү=1, m=3, пз= 2, n4= 1, ns = 2. Therefore, the number of 
permutations of the word is given by: 
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n! 9! 
n1!n2!n3!n4lns! 113121102! 
9x8x7x6x5x4x3x2 
Е 3x2x2x2 
—9x8x7x6x5 
— 15,120 


Similarly, the word REDDER is the multiset (2R, 2E, 2D}, where n = 6, n; = 2, 
n» = 25 пз = 2: 
Therefore, the number of permutations of the word is given by: 


n! 6! 
ninja! 21212! 
6x5x4x3x2 
TU DESERTO 
—6x5x3 
= 90 


In order to visualise the permutations of a multiset, let's consider a shorter word. The 
word NOON is the multiset {2N, 20}, where n = 4, n; = 2, m = 2. 
Therefore, the number of permutations of the word is given by: 


n! 4! 

nun; 2121 
_4х3х2 
© 2x2 
=3x2 


The permutations аге NNOO, NOON, NONO, ONNO, ONON, ООММ. 


6.4 Combinations 


Unlike permutations, combinations disregard the order of elements created from a 
set. Thus (B, A) is the same as (A, B), and only one is selected. The disregard for 
order conflicts with the use of the word combination in the context of locks, where a 
combination lock depends upon the correct order of digits to open the lock. However, 
we are where we are, and we must pay special attention to the mathematical usage 
of the word. 

When selecting two elements from a set of three elements {A, B, C} creates six 
permutations: 
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P(3, 2) = 6, (A,B), (В, А), (А, С), (С,А), (В,С), (СВ) 


А А А А 3 
However, the number of combinations is reduced to three, and is expressed as (5): 


(5) = (А,В), (А,С), (В,С) 


In this context, the relationship between combinations and permutations is: 


В Ni S 
2] | 2 


Therefore, given a set of n distinct elements, and taking k elements at a time, 
there are d combinations. We now require a general formula for the number of 
combinations. 

Table 6.1 shows the twenty-four permutations of the set ( A, B, C, D) taking all the 
elements at a time. However, if we disregard those permutations where the elements 


are ordered differently, we end up with only four combinations: 


(А, В, С), (A, B, D) (A,C, D), (B, C, D) 


As each combination creates 3! — 6 ordered triples, we can generalise the following 


relationship: 
n n P(n, К) n! 
P(n, k) = k! or = = 
k k k! (n — k)!k! 


Thus (6.1) gives the number of combinations of a set of и distinct elements taken k 
at a time: 
n n! 
=, п, КЕМ, n-k (6.1) 
k (n — КК! 


Substituting some values for n and k into (6.1) we get: 


2 2! 2 2! 
= — = 2. ш--- = || 
(1) 1111 (5) 0!2! 
3 3! ! ! 
1) 2! 2) 112! 3) 013! 
4 4! 4 4! 4 4! 4 4! 
= = 4, =. =6, = =4. ------ | 
17.3! 2) 212! 3) 19 4] 014! 


which are identical to the binomial coefficients for x!, x?,..., x" in the expansion 
(1-- х)": 
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Table 6.3 Two-letter combinations of the word COMPUTER 


(С.Е) | (С,О) | (C. M) | (С.Р) | (C. R) | (C, T) | (C, U) 
(О, Е) | (О, M) | (OG, P) | (О, R) | (O, T) | (0,0) 
(M, B) | (М.Р) | (M, R) | M, T) | M, U) 
(РЕ) | (BR) | (РТ) | (BU) 
(U, E) | (U, R) | (U, T) 
(LE) | (LR) 
(E, R) 


(3x = 14 2% + 1x? 
(xy at + 3х? + de 
(+x = 1+ 4x + 6x? + 4х% E" 


Combinations are used in problems where we are interested in selecting groups of 
things, irrespective of order, from some collection. For example, let’s find the number 
of combinations in the set of letters {C, O, M, P, U, T, E, R} taking two letters at a 
time. As there are eight letters, п = 8, and К = 2. Therefore: 


! 
8 _ 8! -8x1 ag 
2 (8 — 2)!2! 2 


The combinations, which have to be derived manually, are shown in Table 6.3. 


6.5 Worked Examples 


6.5.1 Eight-Permutations of a Multiset 


Find the number of 8-permutations of the multiset: 
M = {A, A, B, B, B, B, C, C, C} = {2A, 4B, 3C} 


Solution 

We observe that M contains 9 elements, but have to limit our permutations to 8 
elements. The extra element can be removed either from the 2As, the 4Bs, or the 
3Cs, which means that we must sum the permutations contributed by each possibility. 


Removing it from the 2As, the number of 8-permutations of {A, 4B, 3C} ТЫ 


8! 
Removing it from the 4Bs, the number of 8-permutations of {2A, 3B, 3С} 213131' 
2 


Removing it from the 3Cs, the number of 8-permutations of {2A, 4В, 2С) сар 
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Computing the three contributions: 


8! _ 8х7хбх5х4х3х 2 


114131 — 4x3x2x3x2 
=8x7x5= 280 

8! 8x7x6x5x4x3x2 

213131 2х3х2х3х 2 


=8х7х5 х2 = 560 
8! 8x7x6x5x4x3x2 
214171 2х4х3х2х2 
=2x7x6x5= 420 


The total number of 8-permutations is 280 + 560 + 420 = 1, 260. 


6.5.2  Eight-Permutations of a Multiset 


Find the number of 8-permutations of the multiset: 
М = {A, A, A,B, В, B, C, С, С} = {3A, 3B, 3C] 


Solution 

We observe that М contains 9 elements, but have to limit our permutations to 8 
elements. The extra element can either be removed from the 3As, the 3Bs, or the 
3Cs, which means that we must sum the permutations contributed by each possibility. 


Removing it from the 3As, the number of 8-permutations of {2A, 3B, 3C} Г 
ы” 
Removing it from the 3Bs, the number of 8-permutations of {3A, 2B, 3С} 30131' 
с 
Removing it from the 3Cs, the number of 8-permutations of {3A, 3B, 2С} 3301" 
Each possibility gives the same result, therefore the total number of permutations is 
! 


3x ——: 
31312! 


8! 8x7x6x5x4x3x2 


X — =3 х 
21312! 3x2x3x2x2 
=3х8х7х5 х2 = 1, 680 


3 


The total number of 8-permutations is 1,680. 
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Table 6.4 The 24 permutations of М 
(2,3,4,5) | (2,3,5,4) | (2,4,3,5) | (2,4,5,3) | (2,5,3,4) | (2,5,4,3) | (3,2,4,5) | (3,2,5,4) 
(3,4,2,5) | (3,4,5,2) | (3,5,2,4) | (3,5,4,2) | (4,2,3,5) | (4,2,5,3) | (4,3,2,5) | (4,3,5,2) 
(4,5,2,3) | (4,5,3,2) | (5,2,3,4) | (5,2,4,3) | (5,3,2,4) | (5,3,4,2) | (5,4,2,3) | (5,4,3,2) 


6.5.3 Number of Permutations 


Given a set M = {2, 3, 4, 5}, calculate the number of permutations of M, and write 
down their values. 


Solution 
The number of permutations of M is 4!, and the numbers range between 2, 3, 4, 5 
and 5, 4, 3, 2. 

4! = 4x 3 х2 = 24 


Table 6.4 shows the permutations of M. 


6.5.4 Number of Five-Card Hands 


How many different 5-card hands can be dealt from a deck of 52 cards? 


Solution 
As the order of the cards is not asked for, we are dealing with a problem of combi- 


nations: 
п) _ п! 
К) (n — K)Ik! 


The number of combinations taking 5 cards from 52 is: 


2 2! 2 1 49x 4 11, 875, 2 
(5) 5 _ 52x51 x 50x 49х48 311,875 00 L 5 598, 960 


5) (62-55 | 5x4x3x2 B 120 


6.5.5 Hand Shakes with 100 People 


A group of 100 people shake hands with each other. How many handshakes take 
place? 
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Solution 
As it is not sensitive to order, we require the number of combinations: 


100 100! 100 x 99 
— — Белка 4 
( 2 ) (100 — 2)!2! 2 pt 


6.5.6 Permutations of MISSISSIPPI 


Find the number of permutations of the word MISSISSIPPI. 


Solution 
The word MISSISSIPPI is the multiset (M, 41, AS, 2P}, where n = 11, 


ng =4, n3=4, т = 2. Therefore, the number of permutations of the word is 


given by: 


n! 11! 
пит 1141412 
| Шх lOx 0x 8x 7хбх5х4х3х2 
4x3x2x4x3x2x2 
11х10х9х7х5 


= 34, 650 


There are 34,650 ways of writing the word MISSISSIPPI. 


Chapter 7 R) 
Probability geu 


7.1 Introduction 


This chapter introduces the subject of probability using examples to create vari- 
ous scenarios, including dependent and independent events, mutually exclusive and 
inclusive events, and the use of combinations. The chapter concludes with a section 
on worked examples. 


7.2 Definition and Notation 


Probability is concerned with quantifying the likelihood of an event occurring. 
The closed interval of probability is [0, 1], where 0 corresponds to an event never 
happening, and 1 to an event always happening. This requires dividing the number 
of ways of securing a successful outcome by the total number of possible outcomes: 


2 number of ways of securing a successful outcome 
probability — 


total number of possible outcomes 


Thus probability does not predict what will happen, but the likelihood of some- 
thing happening based upon the available outcomes. Naturally, probability can be 
expressed as a percentage, which entails multiplying by one hundred. 

Itis assumed in the following examples that coins and dice are unbiased, and that 
decks of playing cards are randomly shuffled. For example, when a coin is tossed, it 
will finish in one of two states: heads or tails. The possibility of a coin landing on 
its edge is so remote that it is ignored. Therefore, the probability of securing a head 
or tail is 1/2 — 0.5, and the sum of the two probabilities is 1. 

The notation is relatively simple: the probability of an event A occurring is 
expressed as: 

P(A), or p(A), or Pr(A) 
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Consider a bag containing 10 balls, with 4 red, 3 green, 2 blue and 1 white. If we 
randomly select a single ball from the bag, the probabilities are: 


4 3 2 : 1 
P (red) = 10 —0.4, P(green) = ae 0.3, P(blue) = 10 —0.2, P(white) = mie 0.1 

Similarly, the probability of getting a 6 when rolling adieis P(6) — 1 = 0.16, and 
is the same for all other numbers. When а red and blue dice are rolled simultaneously, 
there are 36 possibilities, and the probability of rolling two 6s is the product of the 
individual probabilities: 


P(two 6s) = — x 
Table 7.1 shows all possible number outcomes. 

Naturally, all pairs of identical numbers have a probability of x. however, the 
other number combinations are repeated twice, and their probabilities are doubled: 
36 + 36 = 18. 

Therefore, when two dice are rolled together, there are 21 possibilities: 6 pairs of 
identical numbers, each with a probability of $ = 0.027, and 15 pairs of different 
numbers, each with a probability of Б = 0.05. 

We can express this in various ways: The probability of getting a pair of identical 
numbers is 6 x x = i — 0.16, and the probability of rolling a pair of different 
numbers is 15 x x = 15 = 0.83. If the probability of rolling a pair of identical 
numbers is 0.16, then the probability for all other number combinations must be 
1 — 0.16 = 0.83. 

We are now in a position to declare formally that the probability of event A is 
within the interval [0, 1] : 


P(A) є [O, 1] 
and the probability of A not happening as: 


P(A) = 1 — P(A) 


where A is the complement of the set A. 


Table 7.1 Possible number outcomes of rolling а red and blue dice 


ми | 325. ЛА | S| 1,6 
2 да а hc 26 
Е За ||) 3313435136 
4,1 | 4,2 | 4,3 4,4 4,5 4,6 
S: 11552: зз | va 55 156 


6,1 | 62 56,3 64! 62| 6,6 
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7.2.1 Independent Events 


Given two independent events A and B, with individual probabilities P(A) and 
P(B), their combined probability P(A M B) is given by: 


P(AN В) = P(A) x P(B) 


For example, tossing a coin twice are two independent events. The probability of 
getting a head or tail on the first or second toss is 0.5. Even if the coin is tossed 
100 times and lands repeatedly in a head position, the probability for a head or tail 
the next time remains 0.5. However, I would be very suspicious if this occurred! 
Therefore, if we toss a coin twice, the probability of getting two heads is the product 
of the individual probabilities: 1 х i = 1. Naturally, this is the same for tails. The 
other possibilities are heads followed by tails, or tails followed by heads. As both 
of these probabilities are І, their combined probability is their sum: 1. We can also 


say that if the probability for getting both heads is І, or both tails is 1, Шеп Ше 
probability of getting both a head and tail must be 1 — (i + 1) = 1. 
Tossing а coin 3 times creates the following 23 = 8 outcomes: 


HHH, HHT, HTH, HTT, THH, THT, ТІН, ТТТ 


We can see that the probability of getting all heads or all tails is 5; the probability 
of getting two heads and one tail i + i + 1 = 2; and the probability of getting 


8? 
А n "EI 1-1 
alternating heads or tails is $ +; = д. 


7.2.2 Dependent Events 


Two dependent events happen when the first event influences the probability of the 
second event occurring. For example, if a bag contains 10 balls with 3 red, 4 green 
and 3 blue. We pose the question: “What is the probability of selecting a red ball on 
two successive selections, without returning the first ball to the bag?’ 

As there are 10 balls, with 3 red, the probability of selecting a red ball on the first 
selection is P (red;) = 5. There аге now 9 balls left, and the probability of selecting 
a second red ball is Р(те4;) = 2. Therefore, the total probability is their product: 


3 2 6 ВЕ 
P (redi 2) = 10 x == 0.06 


Similarly, the probability of selecting 2 successive green balls is for the first selection 


P(green,) = > and for the second selection P(green,) = 8, making a combined 
probability of: 
Р( ) ig = =e 
reen, 5) = — x ~=— =0. 
кесі) “105097 90 
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The possibilities are: 


3 2 6 - 
P (red;, red?) = 10 x 9 = 90 — 0.06 
3 3 9 
P (red;, blue?) = 10 x 9 = 90 = 0.1 
3 4 12 
P (red;, green,) = To x 9 = 50 = 0.13 
4 3 12 
P(green,, red?) = 10 x 9 = 90 = 0.13 
4 3 12 = 
P(green,, blue?) = To x 9= 90 = 0.13 
4 3 12 
P(green,, green,) = To x 9 = 50 = 0.13 
P (blue), red?) = x 5 = 2 = 0.1 
Р(Ы ) ее 0.13 
ue), green,) = — x = = — =0. 
= ^ 10 9 90 
3 2 6 - 
P (blue;, blue.) = 10 x 9 = 90 — 0.06 


Note that the probabilities sum to 1. 

Dependent events also occur in card games. For example, the probability of select- 
ing a black card from a deck of playing cards is 26. The probability of selecting a 
second black card, without returning the first to the deck is 5. Therefore, the prob- 
ability of selecting two successive black cards is: 


26 25 
P(2 blacks) = 52 x 51 А 0.245 


7.2.3 Mutually Exclusive Events 


Two events are mutually exclusive when they cannot occur at the same time. There- 
fore, given two mutually exclusive events A and B, the probability of both happening 
is zero: P(A U B) = 0. 
The probability of either event occurring is the sum of the probabilities of the 
individual events: 
Р(АПВ) = P(A) + P(B) 


For example, let’s calculate the probability of rolling a dice such that the number is 
a 6 or a prime number. The probability of rolling a six is 1, and the probability of 
rolling a prime number (2, 3, 5) is 2. Therefore, the probability of rolling a six, and 
a prime number is: 
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| : 1 3 4 = 
P (six or prime) = 6 + ІТ 0.6 


7.2.4 Inclusive Events 


Inclusive events occur at the same time and their outcomes overlap. For example, 
given two events A and B with individual probabilities P(A) and P(B), their com- 
bined probability P(A U B) is given by: 


P(AU B) = P(A) + P(B) — P(A N В) 


Returning to the example above with a single die, let’s calculate the probability of 
rolling an even number or a prime number. The probability of rolling an even number 
(2, 4, 6) is 8, and the probability of rolling a prime number (2, 3, 5) is also 8. But 
we can see that both events share the number 2, and the probability of rolling a 2 is 
H, therefore, the answer is: 


Similarly, the probability of rolling an odd number (1, 3, 5) is Š, and the probability 
of rolling a prime number (2, 3, 5) is also 8. But we can see that both events share 
the numbers 3 and 5, and the probability of rolling а 3 or 5 is 2, therefore, the answer 
is: 
208, CO ub „2 = 
+ -ш----0.6 
6 6 6 6 3 


7.2.5 Probability Using Combinations 


Some problems lend themselves to the use of combinations to calculate proba- 

bilities. This occurs especially when order is not important. For example, consider 

the problem of dealing 5 playing cards and calculating the probability of finding 2 

queens, 2 kings and an ace. As order is not important combinations can be used. 
The number of all possible outcomes is the number of arrangements of 5 cards 

from a pack of 52: (2). 

The number of outcomes of choosing 2 queens from a total of 4 is (0). 

The number of outcomes of choosing 2 kings from a total of 4 is P 

The number of outcomes of choosing 1 ace from a total of 4 is (1). 

The probability of finding 2 queens, 2 kings and an ace is: 


O xG) x Q 
(5) 
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where: 
52 52! 
©) = 5471 = 2, 598, 960 
(0-4&- 
17 an! 
(2-3- 
27- 22! 
4 4 4 
0) x Ө x () = $ 56 шы Aem 
5 2 , 


The probability of finding 2 queens, 2 kings and an асе is ~ 0.0000554. 

Let's chose another example. Four men and 6 women have been shortlisted for 4 
scholarships. If the selection process is random, what is the probability that 2 will 
be male and 2 will be female? 

As the question does not mention order, we can use combinations to calculate the 
probability. 

The number of outcomes of choosing 2 males from a total of 4 is 6). 
The number of outcomes of choosing 2 females from a total of 6 is (2); 
The number of all possible outcomes of choosing 4 people from a total of 10 is (2). 
Therefore: 
6 
0) 


0) x 


Р(2 male апа 2 female) = i9 
(4) 
where: 
4 4! 
(2) = (4—2)0! 
_4х3х2 _ 
© 2x2 
6 6! 
(5) = (6 — 2)!2! 
E a ЖЕ, зі 
4х3х 2х 2 
10 10! 
тте тт 
= ОО АВ TOS SOE ОЕА _ м 
6x5x4x3x2x4x3x2 
6 x 15 
P (2 male and 2 female) — “210” А 0.429 


The probability that 2 will be male and 2 will be female, is approximately 0.429. 
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Table 7.2 Possible outcomes and probabilities 


Male Female Outcomes Probability 


0 4 1 22 0.0143 
1 3 16 А 0.229 
2 2 36 А 0.514 
3 1 16 А 0.229 
4 0 1 x 0.0143 


If there had been 4 women instead of 6, we would intuitively expect the probability 
to be 0.5, but it is not: 


0) x 


Р(2 male апа 2 female) = 


where: 


4\ 4 
NET 


4x3x2 
ш------ 0 
2х2 


:)- 8! 
b © (8— 4)M! 


йо 
^ 4x3x2x4x3x2 0 


6x6 
P(2 male and 2 female) = s А 0.514 


This result is due to the small sample and the way probabilities are calculated. 
Table 7.2 shows how probability is divided for different outcomes. 


7.3 Worked Examples 


7.3.1 Product of Probabilities 


How many outcomes are possible with four independent events: tossing a coin, using 
a light switch, rolling a die, and selecting a ball from a bag containing four coloured 
balls? 
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Solution 

There are 2 outcomes when tossing a coin: O(coin) = 2. There are 2 outcomes when 
switching a light: O(switch) = 2. There are 6 outcomes when rolling a die: O(die) = 
6. And there are 4 outcomes when selecting a ball: O(ball) = 4. Therefore, the total 
number of outcomes is: 


O(coin) x O(switch) x O(die) x O(ball) = 2 x 2 x 6 x 4 = 96 


7.3.2 Book Arrangements 


Ihave 3 books on IATEX, 2 books оп С, 1 book on Java, and 2 books on Python. How 
many ways can these different books be arranged in my bookcase? 


Solution 
There are 8 books in total; the first book can be one of 8, the second book can be one 


of 7, etc., therefore the number of arrangements is: 


8!=8x7x6x5x4x3x2= 40, 320 


7.3.3 Winning a Lottery 


Calculate the probability of winning a lottery where the successful winner has a 
six-digit number that has no leading zeros. 


Solution 
The first digit can be 1 to 9, and the remaining five digits can be 0 to 9, therefore the 
probability of winning is: 


1 tz 
— — =o. 1 
900,000 = 900000 


7.3.4 Rolling two Dice 


Calculate the probability of rolling a 2 and a 3 with two dice. 


Solution 
The two dice are independent of one another. The probability of rolling a 2 and 3 is: 


1 1 В 
P(Q and3) = -x = — = 0.027 
C= ге 


ale 
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Similarly, the probability of rolling a3 and 2 is also x . Therefore, the total probability 


18: 
m E d = 0.05 
36 36 18 ` 


7.3.5 Two Dice Sum to 7 


Calculate the probability of rolling two dice that sum to 7. 


Solution 
There are 6 ways the two dice can total 7: 


LB 2,5, 3,4, o3 5,2, 6.1 


The probability of any one of the above outcomes is E , therefore the total probability 


is: 6 і 
== = = = 0.16 
36 6 


7.3.6 Two Dice Sum to 4 


Calculate the probability of rolling two dice that sum to 4. 


Solution 
There are 3 ways the two dice can total 4: 


122, 4. WS 


The probability of any one of the above outcomes is x , therefore the total probability 


1S: 3 1 
Il = — = 0.083 
36 12 


7.3.7 Dealing a Red Ace 


Calculate the probability of dealing a red ace from a deck of playing cards. 


Solution 
There аге 52 cards in a full deck, and 2 red aces, therefore, the probability of dealing 


ared ace is: 
2 = : А 0.0385 
55 6% 7 
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7.3.8 Selecting Four Aces in Succession 


Calculate the probability of selecting four aces in succession from a deck of playing 
cards without replacement. 


Solution 

There are 52 cards in a full deck, and 4 aces. The probability of selecting the first ace 
18 2. The probability of selecting a second ace is a. The probability of selecting a 
third ace is 2. . The probability of selecting а forth ace is 5 . The combined probability 
is: 

А: 0.0000037 


4 3 2 1 24 
P (4 aces) = x x x = 
52 51 50 49 6,497,400 


7.3.9 Selecting Cards 


Calculate the probability of selecting an ace or a king from a deck of playing cards 
without replacement. 


Solution 
There are 52 cards in a full deck, with 4 aces and 4 kings. The probability of selecting 


an ace or a king is: 


8 2 
== = — А 0.154 
52 13 


7.3.10 Selecting Four Balls from a Bag 


A bag contains 14 balls with 5 red, 6 green and 3 blue. Four balls are taken from 
the bag in one go; calculate the probability of selecting 2 red balls, and 2 non-red balls. 


Solution 
As order is not important, use combinations to compute the different possibilities. 
There are G) ways of selecting 2 balls from 5 red balls. 


There are (2) ways of selecting 2 balls from the 9 non-red balls. 
Therefore there are 6) х () ways of selecting 2 red balls and 2 non-red balls. 


There are (1% ways of selecting 4 balls from 14 balls. 
Therefore, the probability of selecting 2 red balls and 2 non-red balls is: 
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where: 


9V 9 9x8 
иы das | ды 
2] >! 2 


К NBC МН | ogy 
4) 4110! — 4x3x2 БЕ 
С) x (5) _ 10x 36 0.36 

(19) 1,001 | 


7.3.11 Forming Teams 


Teams of 5 are formed randomly from 8 female and 7 male students. What is the 
probability that a team contains 2 male and 3 female students? 


Solution 
As order is not important, use combinations to compute the probability. 


There are (2) ways of forming the teams. 
There are (2) ways of selecting 2 males from a total of 7. 


There are (5 ways of selecting 3 females from a total of 8. 
The probability that a team contains 2 male and 3 female students is: 


where: 


7 7! 7x6 
= = 21 
2) 25 2 


8 8 8х7 
E x DOT Боа дд 
3 E 3x2 
15 5! 15x 14x I3x 12x 11 
ү = = 5х4х3х2 E 


()x( 21x56 


(С) 3.00 “20892 
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7.3.12 Dealing Five Cards 


Five cards are taken from a deck of playing cards. What is the probability that the 
five cards contain 2 clubs and 3 diamonds? 


Solution 

As order is not important use combinations to compute the probability. 
There are (2) ways of selecting 5 cards from 52 cards. 

There are (5) ways of selecting 2 cards from 13 club cards. 

There are (5) ways of selecting 3 cards from 13 diamond cards. 

The probability that the five cards contain 2 clubs and 3 diamonds is: 


(2) x (5) 


where: 
52 52! _ 52 x 51 x 50 x 49 x 48 
T = 2; Ы 
ЭБ 5!47! 5x4x3x2 928260 
13 13! 13 x 12 
(2) =з 2111 2 
13 13 x I2 x 11 
= = EU шош 
= 3110! 3x2 
(5 
T 2 
(2) x on 8 x 286 0.00858 
С) ros 598, 960 


The probability that the five cards contain 2 clubs and 3 diamonds is ~ 0.00858. 


Chapter 8 A) 
Statistics б 


8.1 Introduction 


This chapter revises some statistical techniques and concepts, with which, the reader 
should be familiar. It covers data sets, frequency tables, stem and leaf diagrams, 
frequency histograms, scatter diagrams, mean, median, mode, range, variance, stan- 
dard deviation, normal distribution, normal distribution tables, covariance, correla- 
tion, regression and the least squares fit. The chapter concludes with some worked 
examples. 


8.2 What Is Statistics? 


Statistics comprises a collection of mathematical techniques associated with data 
sets belonging to populations. Such populations may be complete or sampled, but 
in either case, we are interested in drawing a conclusion about the population, and 
if possible, knowing the degree of confidence associated with this conclusion. If a 
population is sampled, we must expect any conclusion to vary randomly from sample 
to sample. 


8.3 Presentation of Data 


The presentation of data depends upon the nature of the data. For example, the 
use of frequency tables, stem and leaf diagrams, frequency histograms, and scatter 
diagrams, are best used for particular types of data. These are described next, and 
are intended to complement the standard graphical techniques. 
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8.3.1 Data Sets 


Before covering the ways of displaying data, a brief word about data sets. A data 
set is nothing more than a set of data, either the population itself, or a sample of the 
population. I will tend to employ set notation when defining a data set. 


8.3.2 Frequency Table 


A frequency table displays how many times a particular feature of a data set occurs. 
For example, say the weight (kg) of similar objects is recorded as follows: 


W=105 104 100 105 105 102 103 100 105 100 102 
100 104 105 103 101 102 100 102 103 104 104 


or using set notation: 


W = (105, 104, 100, 105, 105, 102, 103, 100, 105, 100, 102, 
100, 104, 105, 103, 101, 102, 100, 102, 103, 104, 104} (8.1) 


A frequency table of (8.1) is shown in Table 8.1, where each column records the 
number of times a particular number occurs. 

Next, we consider another data set concerned with the mark out of 100, for a 
group of students studying computer science. The set S of marks are as follows: 


S = (57, 80, 72, 56, 70, 50, 49, 83, 81, 69, 56, 63, 62, 41, 44, 59, 56, 64, 73, 69, 


55, 64, 60, 44, 72, 70, 69, 66, 61, 62, 83, 68, 59, 68, 71, 66, 39, 45, 53, 55] 
(8.2) 


A frequency table of (8.2) is shown in Table 8.2. In both examples, the very nature 
of the table, hides a feature of the data. 


Table 8.1 A frequency table of (8.1) 


x | 100 101 102 | 103 104 105 


fs 1 4 3 4 5 


Table 8.2. A frequency table of (8.2) 


x | 30 | 40 | 50 | 60 | 70 | 80 


Тр 1 5 10 | 14 | 6 4 
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8.3.3 Stem and Leaf Diagram 


A stem and leaf diagram records 13 as two elements: stem | leaf, i.e. 1 | 3. Sucha 
diagram of (8.1) is shown in Table 8.3. The diagram does not lend itself to the data 
set, and does not add anything to the frequency table. 

A similar diagram of (8.2) is shown in Table 8.4, which is much more useful, 
perhaps because the data are sorted. 


6.3.4 Frequency Histogram 


А frequency histogram shows the data set as a histogram. Figure 8.1 shows a 
frequency histogram of (8.1), which for me, communicates the data set. 

A frequency histogram can be drawn in different ways, and in each way would 
reflect the nature of the original set of data. 


Table 8.3 A stem and leaf diagram of (8.1) 


10 |0 0 0 0 0 
10 | 1 

je 22.2 2 
10|3 3 3 

10|4 4 4 4 


ПО ӨР 5865, 


Table 8.4 А sorted stem and leaf diagram of (8.2) 
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Fig. 8.1 А frequency 
histogram of (8.1) 


Fig. 8.2 A scatter diagram 


8.3.5 Scatter Diagram 


A scatter diagram is a diagram linking an independent variable with a dependent 
variable. For example, Fig. 8.2 shows a scatter diagram where there is an approximate 
linear relationship between x and y. 


8.4 Mean, Median, Mode and Range 


The mean, median and mode generate a single number to represent a data set. This 
number attempts to show something about the data, which generally decides which 
one should be used. 
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8.4.1 Mean 


There are two types of mean: population mean and sample mean. When using the 
data set for the entire population, we use the following formula: 


1 
и = = Xi 
х2; 


where № is the size of ће population. The population mean of the 40 computer 
science students is: 


1 40 
и = 2-5 
B 2484 


~ 40 
= 62.1 


which is in the row beginning with 6 in the sorted stem and leaf Table 8.4. 
When presented with a sample of a data set taken from a population, we call it x 
instead of и: 


where: 


x; is a member of the data set data set. 


n is the number of sample elements. 
The mean of the sample 3, 4, -2, 0, 5, 4 is calculated as follows: 


3-4-2-0-5-4 14 
6 © 6 


=23 


x= 


Remember that У is a ‘capital sigma’ and stands for the ‘the sum of". 
Using a frequency table, where the frequency of a particular number is referenced, 
as in Table 8.1, the population mean is evaluated as follows: 
5 x 100 + 101+4 x 102 +3 x 103 + 4х 104 + 5 x 105 
"m 55144434445 
500 + 101 + 408 + 309 + 416 + 525 
22 


_ 2259 
© 22 
де 102.68 
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8.42 Median 


The median of a data set is the middle value separating the two equal data sets. 
Naturally, this is a particular value for an odd data set, but has to be halved for an 
even data set: 


п is odd, median(n) = x(n+1)/2 


X(n/2) + Х((п/2)+1) 


n is even, median(n) = 5 


For example, if is 5, and the datais 6, 7, 8, 10, 12, the median is 8. Whereas, if n 


is 8, and the data are 2, 4, 7, 8, 12, 16, 20, 22, the median is #4 = 10. Notice 
that before starting calculating the median, the data must be sorted. 


8.4.3 Mode 


The mode of a data set identifies the most frequent value of the data set. 
Figure 8.3 shows the mode, median and mean for an imaginary data set. Data may 
be such that there is no mode, or more than two. 


8.4.4 Range 


The range of a data set X is given by: 


К = Xmax — Xmin 


Fig. 8.3 The difference 
between mode, median and 5 
mean 


Mode Mean Median 


50% 
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For example, given the data set X = (6, 8, 12, 16, 20}, where Xmax = 20 and 
Xmin = 6, and R = 14. 


8.5 Random Variable 


A random variable is a quantity that cannot be defined with any certainty, and per- 
haps, is best defined using a probability distribution. Random variables can be divided 
into two groups: discrete and continuous, either possessing their own probability dis- 
tribution. Discrete distributions arise from counting exercises. For example, in the 
recent Covid-19 epidemic, data was generally, made available every day showing 
the accumulated deaths on a country-wide basis. Such data is described as discrete 
as it was accrued by counting the number of people dying every day. Continuous 
distributions arise from experiments where something is monitored, like a smart 
gas or electricity meter, which displays the consumption of gas and electricity for a 
particular period. For the moment, we'll only consider discrete random variables. 


8.6 Variance 


Let's assume a discrete random variable X with values ху, хо, хз,..., Хи. For X to 
take on a particular value is called the expected value or expectation of X, and 
equals the mean of X. It is written E[ X], and defined as: 


1 +x +з ++ 
ne uoce pem 
і-і 


where E[X] = д. 
There are two ways of defining variance of a random variable X. The first is 
equal to the mean of the square of X minus the square of the mean of X. i.e. 
Var(X) = E[X?] — ХР (8.3) 


Equation (8.3) is also written: 


Var(X) = E[(X — Е ХУ? (8.4) 
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Table 8.5 A table showing x — и and (x — и)? for each x 


x 30 36 42 40 44 48 


fil -10 | 4 2 0 4 8 


((-ш2 100 16 4 0 16 64 


[ 
= E[X? — 2XE[X]+ Е ХР] 
= E[X?] — 2E[X]E[X] + Е[Х]? 
= E[X?] - ХР 


For example, using (8.4) where X = {30, 36, 42, 40, 44, 48}. The expected mean 
of X is: 


Next, we compute x — u and (x — u)? for each member of X in Table 8.5. The 
variance of X is: 


6 
1 2 
Var(X) = c 2 (Q8 — и) 
i=l 


= 100+ 164-440 16+ 64 


Next, let’s show that using (8.3) gives the same result. 
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Table 8.6 А table showing x and x? 


% 30 36 42 40 44 48 


x? | 900 | 1296 | 1764 | 1600 | 1936 2304 


Table 8.6 shows x and x?. Taking the mean of X? we have: 


TE 900 + 1296 + 1764 + 1600 + 1936 + 2304 
6 
= 1633.3 
E[XP = и? = 40? = 1600 
E[X?] — E[X]? = 1633.3 — 1600 = 33.3 


which agrees with the value of variance computed with Var(X) = E[(X — E[X )?]. 
Variance is also known by 82, o? and V(X). o is ‘lower-case sigma’. 
The second definition of variance relies on the definition of standard deviation, 
and is defined as follows: the variance of the random variable X is the expected value 
of the squared deviation from the mean of X: 


Var(X) = E[(X — E[X]Y?] 


where E[X] = и, which is Eq. (8.4). 


8.6.1 Variance with a Probability Distribution 


To calculate the variance of a discrete random variable with a probability function, 
we incorporate the probability function with our definition of variance, and leave out 
the multiplication by 1/n: 


Var(X) = 9 pi» Gi и)? 
i-i 


where p; is the probability function whose sum is 1, jz is the expected value for the 
mean of X, x; is a value of X,n is the number of values in X and p. 

For example, let's calculate the variance of the value of rolling a fair six-sided 
die. We know from our definition of the die that the probability function that: 


fi 2 x o* 43 
Га Е 
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136 
and 
1 
= 1+2+3+4+5+6 
=) (0 +2+3+4+5+6) 
_ 21 
E 
E 
E 
therefore: 


1 5\7 з\? y? AVY BV 5V 
76 Jk Gio (5) +(3) 
1725 1 25 
- [2+ eat +3 
1 70 
= т 
= 2.916 


The variance of rolling а true die is approximately 2.92 (https://en.wikipedia.org/ 


wiki/Variance). 
Let’s imagine that the die is faulty, and never shows a 3, but a 6 instead. How does 


this affect the variance? 
To begin with, we must calculate a new expected value of X: 


1 
м-:5224%2%6%4%5-6 


_ 24 
© 6 
=4 


Next, we need a new probability function, one that reflects the untrue die: 
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therefore: 


6 
Var(X) = Уре: (к — 4)? 
1=1 


Гаа Ту ае Т. 
[оез «gem eoa? ола сауа 207] 


9 4 1 8 
[нонок 
22. 

6 


= 3.6 


The variance of rolling а faulty die is approximately 3.67. 


8.7 Standard Deviation 


The standard deviation is a measure of the dispersion of the sampled data. A low 
standard deviation indicates that the data are close to the mean, whereas a high 
standard deviation indicates that the data are away from the mean. The standard 
deviation of a random variable X is given by: 


о = Var(X) 


For the data set shown in Table 8.6 where Х = {30, 36, 42, 40, 44, 48}, and the 
variance equals 33.3, the standard deviation is the square-root of 33.3, and equals 
approximately 5.77. 

The standard deviation is written: 


1 n 
zu |. .— uy 
c 526 и) 


where u is the expected mean of X, x; is a value in X and n is the number of values 
in the data set. 

Note that the squaring operation used in computing the variance, ensures that it 
is always positive. 

As an illustration of standard deviation, consider the dataset X = (1, 4, 5, 4, 1}, 
whose expected mean equals u = 3. The variance of X is computed using Table 8.7. 
The numbers in X have been chosen to make и = 3. 
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Table 8.7 A table showing x — и and (x — и)? for each x 


% ТОА Е 


х-и — | Jl | 2 | il | 2 


к= | & |i |a| i | 4 


Table 8.8 A table showing y — шапа (y — и)? for each y 


y 3 aka) |e) ФЕ 3 


sese В Ires Ped eo т 


(у= и)? | 1 | 0.25 |1 | 0.25 | 1 


The standard deviation of X: 


Ee 
z 


the standard deviation of X is approximately 1.67. 
Now let's consider the data set Y = (3, 4.5, 5, 4.5, 3}, whose expected mean 


equals u = 4. The values of Y have been chosen to make u = 4. 
The variance of Y is computed using Table 8.8. 
The standard deviation of Y: 


D 
1 
a= |=) 01-4 
i=l 


ЕЕ 
Е 5 


220 


5 
а — н)? 


касс 


^ 0.8367 


the standard deviation of Y is approximately 0.8367. 
Thus a data set with values close to the mean, has a lower standard deviation. 
Please note that the above examples form the complete population, and it is normal 
to divide by n, the number of data points. When the data are taken randomly from a 
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complete population, one divides by n — | to give what is called an unbiased estimate 
of the variance. Readers wishing to know more about this subject should investigate 
(https://en.wikipedia.org/wiki/Standard deviation). 


8.8 Normal Distribution 


A normal distribution is a mathematical function that shows the characteristics 
of some populations. A variety of ways exist to generate these curves, but the one 
we should know about is the Gaussian distribution (https://en.wikipedia.org/wiki/ 
Normal distribution): 


Е 2 
О) = rage (8.5) 


g 


Equation (8.5) is known as a Gaussian distribution. 
The equation for the curve shown in Fig. 8.4 is (8.6) and is known as the standard 
normal distribution or unit normal distribution: 


e 2 


КУЯ 


which peaks at 0.4, and is +1 standard deviation from the mean when +x = 1, and 
+2 standard deviations from the mean when -Ех = 2, etc. It has a total area of 1, and 
is used to solve probability problems where it is assumed that the data are normally 
distributed. ф stands for ‘phi’, which is often written as ф. 

Approximately 6896 (2 x 34.1) of values drawn from a standard normal distribu- 
tion are within 1с away from the mean; approximately 95% (2 x 47.7) of values 
are within +20; and approximately 99.7% (2 x 49.8) of values are within +30. 

For example, a set of students studying computer science are normally distributed 
with a mean mark of 65% and a standard deviation of 10%. Let a group of students 


У 


ф(х) = (8.6) 


Fig. 8.4 А Gaussian Т 
function | | 


04 


34.1% 


13.6%} 2.1% 
0 1 2 3 4 5 
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chosen randomly from the set. What is the probability of the group of students having 
a mark less than 5596, and greater than 8596? 

Knowing that the set of students is normally distributed permits us to solve the 
problem. 55% is one standard deviation from the mean of 65%, which represents 
34.1% of the data. As the total area of the standard normal distribution equals 1, there 
must be 15.9% beneath the first standard deviation, therefore the answer is 0.159. 
85% is two standard deviations above the mean, there must be 2.3% above the first 
two standard deviations, therefore the answer 1s 0.023. 

Let's find for the same group of students the probability of exceeding 65% but 
less than 75%. 75% is one standard deviation above the mean, which covers 34.1% 
of the standard normal curve, therefore the answer is 0.341. 


8.8.1 Normal Distribution Table 


A Normal Distribution Table provides the area of the Standard Normal Distribu- 
tion curve left of the central mean, for solving problems where fractional values of 
standard deviations (https://www.math.arizona.edu/~jwatkins/normal-table.pdf). 

Table 8.9 shows such a table, but only includes halves of standard deviations. For 
example, a set of students studying computer science are normally distributed with a 
mean mark of 65% and a standard deviation of 10%. Let a group of students chosen 
randomly from the set. What is the probability of the student group has a mark less 
than 5096? 

Knowing that the set of students is normally distributed permits us to solve the 
problem. 50% is 1.5 standard deviations from the mean of 65%, and looking up Table 
8.9 shows that the entry for —1.5 is 0.0668. Therefore the answer is 0.0668. 


Table 8.9 A Standard Normal Distribution Table 


Z .00 :01 :02 .03 04 05 06 :07 :08 09 


—3.5 | .0002 .0002  .0002 .0002  .0002  .0002 .0002  .0002  .0002 .0002 


—3.0 .0014 0013  .0012  .0012  .0012  .0011  .0011  .0011  .0010 .0010 


—2.5 | .0062  .0060 .0059 .0057 .0055  .0054  .0052  .0051  .0049 .0048 


—2.0 | .0228  .0222  .0217  .0219  .0207  .0202  .0197  .0192  .0188 .0183 


—1.5 | .0668  .0655  .0643  .0630  .0618 .0606  .0594  .0582  .0571 .0559 


—1.0 | .1587 1563 15399 .1515  .1492 1469  .1446  .1423  .1401 11379 


—0.5 | .3085  .3050 .3015 .2981 .2946 22912  .2877  .2843  .2810 2.2776 


—0.0 | .5000 4960 4920  .4880  .4841 4801  .4761  .4721  .4681 .4641 
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8.9 Covariance and Correlation 


8.9.1 Covariance 


Covariance in statistics is a measure of how much two random variables vary 
together, whereas variance measures how much a single variable varies. Figure 8.5 
shows three types of covariance. A large negative covariance results when a small 
value in X corresponds with a high value in Y, and a high value in X corresponds 
with a small value in Y. This results in points having a negative slope. A large pos- 
itive covariance results when a small value in X corresponds with a small value 
in Y, and a high value in X corresponds with a high value in Y. This results in 
points having a positive slope. A near zero covariance results when there is no cor- 
respondence between X and Y (https://www.statisticshowto.com/probability-and- 
statistics/statistics-definitions/covariance/). 
Covariance is defined for discrete random variables by: 


1 n 
Cov(X, Y) = - УЖЕ = их)(у — Hy) 


i=1 


where X and Y аге the discrete random variables, их is the expected mean of X, uy 
is the expected mean of Y, x; is a value in X, y; is a value in Y, and 1/n is the equal 
probability in x; or yj. 
For example, to calculate the covariance of X = (2,2.5,3.5, 4) апа У = 
(8, 9, 10, 13]. 
The expected mean of X is их = 3, and the expected mean of Y is uy = 10. 
Next, we compute x — их and y — wy foreach member of X and У, in Table 8.10. 
Finally, we sum (x — их)(у — иу) and divide by 4: 


2405-043 5.5 
4 E! 


Cov(X, Y) = = 1.375 


Large negative Near zero Large positive 


covariance between covariance between covariance between 
X and Y X and Y X and Y 


Fig. 8.5 Three types of covariance 
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Table 8.10 A table used to calculate the covariance of X and Y 


% 2 289 3.5 4 

у 8 | 9 | 10 | 13 

xX — их -1 -05 | 0.5 1 

У- Му -2 -1 0 3 

(х = их)(у = иу) | 2 | 05 0 3 


The covariance is positive, therefore X and Y are positively related, with a value of 
1.375. 

One should be careful computing covariance as its value is influenced by the axis 
units. For example, say we want to find the relationship between some goods and 
their prices. We could compute the covariance using pounds or pence, and get two 
different values. 


8.9.2 Correlation 


Correlation determines whether two random variables are linearly related, and per- 
haps the most useful quantity is Pearson’s product-moment correlation coefficient 
(PPMCC), also called the correlation coefficient. 
The English mathematician Karl Pearson (1857-1936), is credited with creating 
the discipline of mathematical statistics and the coefficient, which bears his name. 
The population correlation coefficient px y between two random variables X and 
Y is defined as: 


Cov(X, У E[(X — Y – А 
дебату ov(X, Y) _ EIX — их)( by) fever 2i 
OxOy OxOy 
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Table 8.11 А table for calculating the standard deviation of X and У 


x 2) 2.5 3.5 4 


y 8 9 10 | 13 


х= их -І | -0.5 0.5 1 


where: 


их is the expected mean of X. 
Мү is the expected mean of У. 
Ox is the standard deviation of X. 


oy is the standard deviation of Y. 


Let's find the correlation coefficient of X = (2, 2.5, 3.5, 4} and Y = (8,9,10, 13} 
in the previous example where Cov(X, Y) — 1.375, and the expected mean for 
их = З апа му = 10. Next, we must find the standard deviations oy and oy. 

The standard deviation of X using Table 8.11 is calculated as follows: 


4 
1 
ox= |4 (а =o) 


ЕТ сс 
B 4 


А 0.79 


The standard deviation of Y using Table 8.11 is calculated as follows: 


1 4 
ov = 42,0: - 19? 


the standard deviation of X and Y is approximately 0.79 and 1.87 respectively. 
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Table 8.12 A table showing calculations for Cov(X, Y) and the standard deviation 


x 1 2) 3 4 

у 4 0 3 1 
х= их -15 | -0.5 0.5 1.5 
y — Шу 2 -2 1 —1 


(x — их)(у — иу) 3 1 0.5 | —1.5 


(x — их)? 2.25 | 025 025 2.25 


(у = шу)? 4 4 1 1 


Dividing 1.375 by ox = 0.79 and oy = 1.87 we have: 


1.375 
PXY = 979 x L87 ^ 979 


therefore, the correlation coefficient px y ^: 0.93, which confirms that X and Y are 
related. 

Let's calculate the correlation coefficient for two random variables that are almost 
unrelated: X = (1, 2, 3, 4} and Y = (4, 0, 3, 1} using Table 8.12 as follows: 


Cov(X, Y) : У ( T ) 23 _ 075 
M = = i = = = —0. 
ov(X, 42- xi = их): — Шүу 1 


The standard deviation of X: 


the standard deviation of X is approximately 1.12. 
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The standard deviation of Y: 


1-4 
oy = | У = шу)? 


the standard deviation of У is approximately 1.58. Therefore, the correlation coeffi- 


cient is: 
—0.75 


а ДОД 
1.12 х 1.58 


which is almost unrelated. 


8.10 Regression 


Regression analysis is a statistical technique where we attempt to discover the degree 
of association between two variables. For example, we may perform an experiment 
which generates values of a dependent variable from values of a set of independent 
variables. This results in a set of values that may have an underlying association that 
links the values together with a straight line, a quadratic curve, or a higher degree 
polynomial. 

Simple regression analysis permits us to create a straight line equation, which, 
within reason, allows us to predict how the model behaves for different combination 
of values. Such an equation, creates a regression line linking the two variables 
y = f(x), where x is the independent variable, and y is the dependent variable. 
Consequently, the regression line represented by у = f (x) is known as a regression 
line of y on x. 


6.10.1 | Least Squares 


Figure 8.6 shows points Ру, P» and P, that are the result of some experiment. The 
object is to draw a straight line with equation y = mx + c, such that it minimises 
the square of the vertical lines between апу two points P, and О,. Consequently, 
the technique is called the line of least squares. For example, the point Qı has 
coordinates (хі, mx, + c), Q2 has coordinates (xo, mx» + c), and for any point 
О,(х,, тх, + с). The distances of the lines joining any two points аге: 
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Pa [a n: Уп ) 


у = тт + С 


Fig. 8.6 А diagram for least squares 


У1-С- тх, yo—c—mxo, y3—C—MX3,..., Yn — € — тх, 


The square of these distances is taken to ensure that they are always positive. 

Partial differentiation is used to solve this problem, and those readers wishing to 
investigate this should look at (https://en.wikipedia.org/wiki/Least squares). For us, 
the solution to the least squares problem is: 


(их, шу) = (mean of x, mean of y) 


y=mx+c 
IX». EXE 
m n n n 


Sry _ Соу(Х, Ү) 
о2 Var(X) 


where у = mx + с is the straight line equation of the least squares, Cov(X, Y)/ 
Var(X), and с = uy — тих. 

Let’s test this with an example. Table 8.13 shows the relevant computations, there 
are 4 points with coordinates: 


X = (1, 3, 5, 7} 
У = {0.5, 3, 2, 4} 


их = 4 
5.625 + 0.625 + 0.375 + 4.875 
Cov(X, У) = T 1 T = 2.875 
9+1+1+9 
Var(X) = НЕЕ =5 


4 
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Table 8.13 A table for calculating Cov(X, Y) and Уаг(Х) 


Я 1 3 5 7 

y 0.5 3 2) 4 
х= их -3 —1 1 3 
У = Шү —1.875 | -0.625 0.375 1.625 


(х= их)(у = py) | 5.625 0.625 0.375 4.875 


(x — ux) — их) 9 1 1 9 


to 
о 


Qo үү d$ $ ү ү” 


=} 


Fig. 8.7 A straight line through 4 points using least squares 


2.87 
m= = = 0.575 


c = 2.375 — 0.575 x 4 = —0.075 
у = 0.575х — 0.075 


Figure 8.7 shows the original 4 points, and the best fit equation, у = 0.575x — 0.075. 
The Worked Example reverses the y-coordinates. For а complete discussion on this 

algorithm refer to (https://www.ncl.ac.uk/webtemplate/ask-assets/external/maths- 

resources/images/Regression_and_Correlation.pdf). 


8.11 Summary 


Statistics is a very large subject, and hopefully this short chapter has given you a 
flavour of the subject. 
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Table 8.14 A frequency table of X 
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x | 200 


201 


202 


203 


204 


205 


206 


F 4 


2 


4 


4 


8.12 Worked Examples 


8.12.1 Frequency Table 


Use a frequency table to display X. 


Solution 


Table 8.14 shows a frequency table of X. 


8.12.2 Stem and Leaf Diagram 


X = {206, 203, 200, 204, 206, 201, 203, 200, 205, 200, 202, 
201, 205, 204, 203, 202, 202, 200, 203, 204, 205, 202} 


Use a stem and leaf diagram to display a sorted Y. 


У = {56, 79, 73, 55, 71, 49, 51, 82, 81, 68, 57, 64, 60, 44, 46, 58, 54, 62, 75, 68, 
53, 66, 59, 46, 71, 70, 67, 68, 59, 63, 81, 69, 57, 69, 69, 68, 38, 46, 51, 55} 


Solution 
Sort Y: 


У’ = (38, 44, 46, 46, 46, 49, 51, 51, 53, 54, 55, 55, 56, 57, 57, 58, 59, 59, 60, 62, 
63, 64, 66, 67, 68, 68, 68, 68, 69, 69, 69, 70, 71, 71, 73, 75, 79, 81, 81, 82) 


Table 8.15 shows a sorted stem and leaf diagram of Y. 
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Table 8.15 А sorted stem and leaf diagram of Y 


Fig. 8.8 А frequency 
histogram of Y 


8.12.3 Frequency Histogram 


Use a frequency histogram to display Y. 


У = {56, 79, 73, 55, 71, 49, 51, 82, 81, 68, 57, 64, 60, 44, 46, 58, 54, 62, 75, 68, 
53, 66, 59, 46, 71, 70, 67, 68, 59, 63, 81, 69, 57, 69, 69, 68, 38, 46, 51, 55} 


Solution 
Sort Y: 


У’ = (38, 44, 46, 46, 46, 49, 51, 51, 53, 54, 55, 55, 56, 57, 57, 58, 59, 59, 60, 62, 
63, 64, 66, 67, 68, 68, 68, 68, 69, 69, 69, 70, 71, 71, 73, 75, 79, 81, 81, 82) 


Figure 8.8 shows a frequency histogram of Y. 


150 8 Statistics 


8.12.4 Mean 


Calculate the mean of Y. 


У = {56, 79, 73, 55, 71, 49, 51, 82, 81, 68, 57, 64, 60, 44, 46, 58, 54, 62, 75, 68, 
53, 66, 59, 46, 71, 70, 67, 68, 59, 63, 81, 69, 57, 69, 69, 68, 38, 46, 51, 55} 


Solution 


1 2 
и = 2-7 
_ 2423 


2240 
= 60.575 


The mean of Y is approximately 60.58. 


8.12.5 Median 


Calculate the median of Y. 


У = (56, 79, 73, 55, 71, 49, 51, 82, 81, 68, 57, 64, 60, 44, 46, 58, 54, 62, 75, 68, 
53, 66, 59, 46, 71, 70, 67, 68, 59, 63, 81, 69, 57, 69, 69, 68, 38, 46, 51, 55} 


Solution 
Sort Y: 


У’ = (38, 44, 46, 46, 46, 49, 51, 51, 53, 54, 55, 55, 56, 57, 57, 58, 59, 59, 60, 62, 
63, 64, 66, 67, 68, 68, 68, 68, 69, 69, 69, 70, 71, 71, 73, 75, 79, 81, 81, 82) 


There are 40 numbers in Y, which is even. Therefore: 


Yao + Уор 
2 
62+ 63 
Do 0 
= 62.5 


median(Y’) = 


The median of Y is 62.5. 
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8.12.6 Mode 


Calculate the mode of Y. 


У = (56, 79, 73, 55, 71, 49, 51, 82, 81, 68, 57, 64, 60, 44, 46, 58, 54, 62, 75, 68, 
53, 66, 59, 46, 71, 70, 67, 68, 59, 63, 81, 69, 57, 69, 69, 68, 38, 46, 51, 55} 


Solution 
Sort Y: 


У’ = (38, 44, 46, 46, 46, 49, 51, 51, 53, 54, 55, 55, 56, 57, 57, 58, 59, 59, 60, 62, 
63, 64, 66, 67, 68, 68, 68, 68, 69, 69, 69, 70, 71, 71, 73, 75, 79, 81, 81, 82) 


The mode of Y is 68. 


8.12.7 Range 


Calculate the range of Y. 


Y = (56, 79, 73, 55, 71, 49, 51, 82, 81, 68, 57, 64, 60, 44, 46, 58, 54, 62, 75, 68, 
53, 66, 59, 46, 71, 70, 67, 68, 59, 63, 81, 69, 57, 69, 69, 68, 38, 46, 51, 55] 


Solution 
Sort Y: 


Y' — (38, 44, 46, 46, 46, 49, 51, 51, 53, 54, 55, 55, 56, 57, 57, 58, 59, 59, 60, 62, 
63, 64, 66, 67, 68, 68, 68, 68, 69, 69, 69, 70, 71, 71, 73, 75, 79, 81, 81, 82) 


The range of Y is 82 — 38 — 44. 
8.12.8 Variance 


Calculate the variance of X. 


X = {50, 55, 56, 71, 73, 79} 
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Table 8.16 А table showing the calculations for variance 


x 50 | 55 | 56 | 71 | 73 79 


X 14 9 8 | 7 11 15 


(х-ш2 | 196 | 81 | 64 | 49 | 121 | 225 


Solution 


The mean of X is 64. 
Next, we compute x — и and (х — ш)? for each member of X in Table 8.16. The 


variance of X is: 


6 

І 2 

Уаг(Х) - 6 ] (x; — и) 
1=1 


196 + 81 + 64 + 49 + 121 + 225 
6 


736 
6 
122.6 


The variance of X is approximately 122.67. 


8.12.9 Standard Deviation 


Calculate the standard deviation of X. 


Х = {52, 55, 56, 58, 61, 66} 
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Table 8.17 А table showing the calculations for standard deviation 


x 52 | 55 | 56 | 58 61 66 


ее 


((-шШ2 36194 0 9 64 


Solution 


The expected mean of X is 58. 
Next, we compute x — u and (x — р)? for each member of X in Table 8.17. 


6 

І 2 

Var(X) = 6 у (xi — и) 
i=l 


36494440494 64 
= 6 

ЕШ); 

726 

= 203 


The variance of X is 20.3, and the standard deviation is approximately 4.51. 


8.12.10 Normal Distribution 


A set of students studying maths are normally distributed with a mean mark of 60% 
and a standard deviation of 10%. Let a group of students chosen randomly from the 


set. What is the probability of the group of students having a mark less than 50%, 
and greater than 80%? 


Solution 
50% is one standard deviation from the mean of 60%, which represents 34.1% of 
the data. As the total area of the standard normal distribution equals 1, there must 


154 


Table 8.18 A table showing the calculations for covariance 


a 2) 25 3.5 4 

b 13 10 9 8 

а — МА -1 -0.5 | 05 1 

b— ив 3 0 = | 2 

(а— uA)(b — ив) | -3 0 | Os | 2 
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be 15.9% beneath the first standard deviation, therefore the answer is 0.159. 80% 
is two standard deviations above the mean, there must be 2.3% above the first two 


standard deviations, therefore the answer is 0.023. 


8.12.11 Normal Distribution Table 


A set of students studying maths are normally distributed with a mean mark of 
60% and a standard deviation of 10%. Let a group of students chosen randomly from 
the set. What is the probability of the group of students having a mark less than 5596? 


Solution 


55% is one-half standard deviation from the mean of 60%, and looking up Table 8.9 


shows that the entry for —0.5 is 0.3085. Therefore the answer is 0.3085. 


$.12.12 Covariance 


Calculate the covariance of A = (2, 2.5, 3.5, 4} and B = (13, 10, 9, 8}. 


Solution 


ША = 3, and ив = 10. Next, we compute a — ил and b — ив for each member of 


A and B, in Table 8.18. 


Finally, we sum (a — ид)(Ь — ив) and divide by 4: 


3-0-0.5 


Соу(А, В)- 


4 


The covariance is negative, therefore A апа В are negatively related, with a value of 


approximately —1.375. 
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Table 8.19 A table showing calculations for Соу(Х, Y) and the standard deviation 


% 10 2 8 4 
y 4 10 5) Т 
х= их 4 -4 Я | = 
Уу = Шү -2 4 -3 1 


(x — их)(у — иу) 8 16 6 2 


(x = их)? 16| 16 4 4 


(у му)? 4|16 |911 


8.12.13 Correlation Coefficient 


Calculate the correlation coefficient for two random variables: X = {10, 2, 8, 4} 
and Y = {4, 10, 3, 7}. 


Solution 
Using Table 8.19 as follows: 


1 —32 
я X — их): — иу) = =" 


The standard deviation of Х: 


1 MEME 
Ox = La — их)? 


The standard deviation of X is approximately 3.16. 
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The standard deviation of Y: 
122 
ха = 3 T 2 
бү = 4 А Qi — Му) 


-j3 


The standard deviation of Y is approximately 2.74. Therefore, the correlation coef- 
ficient is: 


-8 


— — 24 
316x274 ^ 0? 


The correlation coefficient is approximately —0.924. 


6.12.14 Least Squares 


Calculate the equation of a straight line the passes through the points stored in X and 
Y using the least squares algorithm. 


Xen. 5 7} 
— (4, 2, 3, 0.5] 


Solution 
Table 8.20 contains the relevant calculations. 


Table 8.20 Calculations for the covariance and variance of X and Y 


ў 1 3 5 7 

у 4 2 3 0.5 
xX = их -3 —1 1 9 
y— Мү 1.625 0.375 | -0.625 -1.875 


(x — ux) — uy) 4.875 0.375 0.625 | —5.625 


(x — ux) — их) 9 1 1 9 
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Fig. 8.9 А straight line through 4 points using least squares 


их =4 
Мү = 2.375 
Cov(X, Y) = —4.875 — ae 0.625 — 5.625 2 2875 
мшш) ?+1+1+9 _, 
т = 225 = —0.575 


с = 2.375 + 0.575 x 4 = 4.675 
у = —0.575x + 4.675 


Figure 8.9 shows the original 4 points, and the best fit equation, у = —0.575x + 
4.675. 
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Chapter 9 ®) 
Modular Arithmetic реа 


91 Introduction 


This chapter introduces modular arithmetic and its notation. It also shows how modu- 
lar arithmetic is used in practice with worked examples. The author acknowledges the 
following references in researching this chapter (Gullberg 1997; https://www.doc.ic. 
ac.uk/-mrh/330tutor/ch03.html; www.en.wikipedia.org/modular arithmetic; www. 
vocalink.com). 


9.2 Informal Definition 


Modular arithmetic is concerned with integers, especially when one is divided by 
another called a modulus giving a remainder. This results in a cyclic remainder 
sequence for successive integers. For example, Table 9.1 shows the remainders, 
1, 2, 0, 1, 2, 0, 1, 2, 0, 1 when the integers 1 to 10 are divided by the modulus 
3, and Table 9.2 shows the remainders, 1, 2, 3, 0, 1, 2, 3, 0, 1, 2 when dividing 
by 4. 

An every-day example of modular arithmetic is the 12-hour clock where the 
modulus is 12. The 24 hours become the sequence: 


1,2,3,4,5,6, 7, 8, 9, 10, 11,0, 1,2, 3, 4, 5, 6,7, 8, 9, 10, 11,0 


9.3 Notation 


The notation of modular arithmetic is very simple and takes the form: 
a(modn)=r, а «?, reN, neN 


© Springer Nature Switzerland AG 2024 159 
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Table 9.1 Тһе remainder dividing by modulus 3 


mel | 2 | 3 4S | 6 | 7 | 8 97 10 


mem 0 0,11|1 1(|2/|21/2/3 | 3 


Remainder 120120120 1 


Table 9.2 The remainder dividing by modulus 4 


mem 0001222 


Remainder 1123010230 1 2 


where a is some integer, n is the modulus, and r is the remainder. An example being 
23 (mod 5) = 3. 


9.4 Congruence 


In mathematics congruence means ‘similar to’, and in the context of modular arith- 
metic, two numbers are congruent when they have the same remainder with acommon 
modulus. Although this is an informal definition, it remains mathematically correct. 
Formally, congruence is expressed without any mention of remainders as: 

Leta, b € Z,n є №, ћепа and b are congruent modulo п, writtena = b (mod n), 
iff a — b is divisible by n, written n | a — b, and implies that there is an integer k 
such thata — b = k xn. 

There is no mention of remainders, however the definitions are identical as a 
consequence of the definition: a = b (mod n). 

For example, given 18 (mod 4) and 14 (mod 4), then 18 — 14 = 1 x 4 therefore, 
14 = 18 (mod 4). They also both have remainders 2, which is the useful feature of 
modular arithmetic. Remember that the integers a and b can be positive or negative, 
but the modulus n > 0. Here are some further examples: 


3 = 6 (тоа 3) = 0 
23 = 34(mod 11) = 1 
31 = 11 (mod 4) = 3 
35 = 5 (той 6) = 5 
121 = 49 (mod 8) = 1 
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We can also write a = r (mod n) which is stating that an integer a is congruent with 
its remainder r, modulus n. For example, 14 = 4 (mod 5). 


9.5 Negative Integers 


Although modular arithmetic applies to positive and negative integers, the remainder 
is always positive, which means that we need to be careful when dealing with negative 
numbers. The formula associated with modular arithmetic is: 


a=kxn+r (9.1) 


Using (9.1), for any positive number а and modulus п, there will exist a value of k 
such that r < n. For example, when а = 11 and n = 3, then k = 3 and a remainder 
r=2. 

Now when a is negative, k must also take on a negative value, such that r < п, 
and remains positive. For example, Ға = —11 and n = 3, then k = —4: 


—1=—4х3+1 


therefore, —11 (mod 3) = 1. 
Let's take another example where а = —7 and n = 3, Шеп К = —3: 


-7--3х3--2 


therefore, —7 (mod 3) = 2. 

The reason for the above rule is that a continuum must exist between negative and 
positive numbers. Table 9.3 shows an example where n = 3 and a varies between 
—5 and +5. 

One last observation concerning negative numbers; if a = b (mod n) then 
—a = —b (mod n). To illustrate this given 13 = 18 (mod 5) = 3, then —13 = 
—18 (mod 5) = 2. Observe the change in remainder. 


Table 9.3 The continuum between -5 and +5 


Юр 


а (mod 3) 1 2 0 1 2 ова Ро о 71741 2 


162 9 Modular Arithmetic 


9.6 Arithmetic Operations 


In this section we explore some of the relationships between numbers and number 
pairs when sharing the same modulus. 


9.6.1 Sums of Numbers 


Let’s see what happens to the remainders of five numbers when they are summed 
together. Let the numbers be 5, 6, 7, 8, 9 with a sum of 35, and a modulus of 3. 
Therefore: 


5 (mod 3) = 2 
6 (mod 3) = 0 
7 (mod 3) = 1 
8 (mod 3) = 2 
9 (mod 3) = 0 
35 (mod 3) = 2 


It is easy to add the individual numbers and find the remainder from the sum, but 
with much larger numbers, we can sum the individual remainders and subject this to 
the same modular arithmetic. In the case of the above five numbers, the remainders 
sum to2+0+1+2+0=5, and 5 (mod 3) = 2. 

Let’s take two relatively large numbers 3,412 and 5,354 modulo 17. Therefore: 


3,412 (mod 17) = 12 and 5,354 (mod 17) = 16 


the sum of the two numbers is 8,766, and the sum of their two remainders is 
12 + 16 = 28. Therefore, the remainder of 8,766 is 28 (mod 17) = 11. 
The general formula for this observation is: 


ifa+b=c, a(modn)+ b (mod n) = c (mod n) 


Furthermore, if a and b are increased by an integer i, then the following relationship 
holds: 
ifa = b (mod n) then a + i = b + i (mod n) 


For example, if a = 13 and b = 20, then 13 = 20 (mod 7). Therefore, when i = 5, 
then: 
18 = 25 (mod 7) = 4 
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Another relationship concerns pairs of congruent numbers: 


ifa = b (modn) and с = а (mod n) then a + c = b + d (mod n) 


For example, if a = 21, b = 31, c = 18, d = 33, n = 5 then: 


21 = 31 (mod 5) = 1 
18 = 33 (mod 5) = 3 
21 + 18 = 31 + 33 (mod 5) = 4 


9.6.2 Products 


When we find the product a x b = c, then a (mod n) x b (mod n) = c (mod n). 
For example, 12 x 14 = 168, then: 


12 (mod 5) = 2 
14 (mod 5) = 4 
168 (mod 5) = 3 


Therefore, 2 x 4 (mod 5) = 3 (mod 5) = 3. 
Let’s try another example: 11 x 12 = 132, then: 


11 (mod 3) = 2 
12 (mod 3) = 0 
132 (mod 3) = 0 


Therefore, 2 x 0 (mod 3) = 0 (mod 3) = 0. 


9.6.3 Multiplying by a Constant 


Given а = b (mod n), then multiplying a and b by an integer k does not alter the 
congruence relationship: k x a = k x b (mod n). For example: 


23 = 33 (mod 5) = 3 


therefore multiplying 23 and 33 by 2 gives 46 = 66 (mod 5) = 1. Note that the 
remainder changes from 3 to 1, which is determined by 3 x 2 (mod 5) = 1. 
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Let’s try another example: 
11 = 17 (mod 6) = 5 


therefore multiplying 11 and 17 by 3 gives 33 = 51 (mod 6) = 3. This time the 
remainder changes from 5 to 3, which is determined by 5 x 3 (mod 6) = 3. 


9.6.4 Congruent Pairs 


Given a congruent pair а = b (mod n) and c = d (mod n), then a x c =b x 
d (mod n). 
For example: 


3 = 7 (mod 4) = 3 
11 = 15 (mod 4) 23 


therefore: 


3 x 11 =7~x 15 (mod 4) = 1 
33 = 105 (mod 4) = 1 


Note that the remainder has changed from 3 to 1. 


9.6.5 Multiplicative Inverse 


In normal arithmetic, we must avoid accidentally dividing by zero. So too, in modular 
arithmetic, as zeros often arise simply by changing a modulus. The following are not 
permitted: 


2/5 (mod 5) 
3/10 (mod 5) 
5/15 (mod 5) 


etc. 


because 5, 10, 15, etc. are all congruent with 0 (mod 5), and create a divide by zero 
condition. 

To avoid a potential divide by zero, a multiplicative inverse is employed, such 
that division is replaced by multiplication. For example, in normal arithmetic, 10/5 
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can be replaced by a product as follows: 


10/5 2 10 x i 


where 5 x i = 1 


In this case, the answer is easy: i = 0.2, which means that 10/5 = 10 x 0.2. 
In modular arithmetic, the operation: 


a/b (mod n) 


is replaced by: 
a x i (mod n) 


where i is the multiplicative inverse of b, if one exists, such that: 
bxiz1(modn) 

Say we need to compute 4/5 (mod 6), then we require the multiplicative inverse 
of 5 (mod 6). To simplify this search, Table 9.4 shows the products of the numbers 
0 to 5 (mod 6), and is referenced (row, column). The table shows that there are 2 
entries containing 1: (1, 1), (5, 5), highlighted in red. 


The value of i for the divisor 5 (mod 6) is determined as follows: 


4/5 = 4 x i (mod 6) 
and 5 x i = 1 (mod 6) 


Table 9.4 shows that the entry (5, 5) contains a 1, therefore i = 5, and means that: 


4/5 = 4 x 5 (mod 6) 


Table 9.4 Products (mod 6) 


x Pon 5 


о о о о ооо 


о 2345 


0240 24 


3030303 
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Table 9.4 shows that the product 4 x 5 = 2 (mod 6), therefore: 

4/5 = 2 (mod 6) 
Next, say we need to compute 3/2 (mod 6), then we have: 


3/2 = 3 x i (mod 6) 
and 3 x i = 1 (mod 6) 


But the product 3 x i in Table 9.4 does not contain an entry with a 1, therefore the 
operation 3/2 (mod 6) is not permitted. 


9.6.6 Modulo a Prime 


Table 9.5 shows the products for the numbers 0 to 6 (mod 7), where one notices that 
every row, apart from zero, contains the numbers 0 to 6 in different order, and only a 
single 1. This is because 7 is a prime number, and the product of any two numbers, 
apart from zero, will never introduce any congruent results. This results in a theorem 
which states that: 

When the modulus n is a prime number, then it is possible to divide by any 


non-zero number. Consequently, for any b Е (1, 2, 3,..., n — 1}, there is only one 
number i € (1, 2, 3,..., n — 1} such that: 
| 1. 
р хі = 1(тойлп) and b = і (mod n) 


Table 9.5 Products (mod 7) 


x MOSS ОЗА л ЛЄ 


оо о о оо о/о 


оО 2/3 4 5/6 


21012 4 6 MEM 3 5 


0362 5 М4 
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From Table 9.5 we have (1, 1) = (2, 4) = G, 5) = (4, 2) = (5, 3) = (6, 6) = 1, 
therefore: 


Generally, we can write: 


bxi=tI1(modn) and 5 =a x i (mod n) 
Therefore, using modulo 7: 
ыз аж ed - 2867 ый 
1 2 3 4 5 6 


Table 9.6 shows the products that equal | for a modulus between 2 and 11. The 
table shows that when the modulus лп is a prime number, i.e. 2, 3, 5, 7 and 11, there 
are п — 1 products. These are highlighted in yellow. 

For completeness, I have included the following product tables: Table 9.7 
modulo 2, Table 9.8 modulo 3, Table 9.9 modulo 4, Table 9.10 modulo 5, Table 9.11 
modulo 9, Table 9.12 modulo 10 and Table 9.13 modulo 11. 


Table 9.6 Product pairs that equal 1 with different modulo 


Mod i DES 3x 4x 5x 6x js 8x 9x 10 x 


2 LEN 


3 ix? | 2x2 


4 1х1 3х3 


E 1х1 2х3 | 3х2 4х4 


8 1x1 3903 5x3 To 
9 1x1 2x5 4х7 5x2 7х4 | 8х8 
10 1х1 3x7 7x3 9x9 


11 1х1 2x6 | 3x4 4х3 5x9 | 6x2 7х8 | 8x7 | 9x85 10 x 10 


168 


Table 9.7 Products (mod 2) 


х М 
ооо 
1 о 1 


Table 9.8 Products (mod 3) 


х 1 2 
о о о 0 
110 MI 2 
2200 2 gl 


Table 9.9 Products (mod 4) 


x МР 
о о о о о 
110 2 3 
ЖИ 02 0 |2 
зо з 2 agg 


Table 9.10 Products (mod 5) 


x MOS За 
о о о о о 0 
1100 И2 3 4 
21012 4 EM 3 
31013 NM 4 2 
#101432 
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9.6 Arithmetic Operations 


Table 9.11 Products (mod 9) 


x ПОЗ 
о о о о о 0 
о 23 4 
0124 6 8 
Б 0 3 оо 3 
014 8 3 7 
90 5 6 2 
60630 6 
UNO 7 53 gg 
EM о в 7 6 5 


Table 9.12 Products (mod 10) 


x МЫ 
о о о о оо 
оО м2 3 4 
024 6 8 
0369 2 
ШІ 0.48 2 6 
5 O0 5 0|$5 0 
606218 4 
uo 7 4M 8 
80864 2 
0987 6 


169 
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Table 9.13 Products (mod 11) 


x | 0| 1 2 3 4 5 6 7 8 9 10 
о 0 о 0 0 0 0 0 0 0 0 0 
11011 2) 3 4 5 6 7 8 9 10 
2 O | 2 4 6 8110 1 3 5 7 9 
з 0 | 3 6 9 1 4 ГІШ! 2 5 8 
4/0) 4 8 1 5 9 2 ОЭ 10835 7 
5 015 | 10) 4 2 3 8 2 y 1 6 
6 о | 6 1 7 2 8 3 9 4 10 5 
7 о 7 3 10/6 2 9 5 [| 8 4 
8 |0 | 8 5 2 | 10) 7 4 1 9 6 3 
2m 0 9 ТІ 5 3 1 10 8 6 4 2 
00 10 9 8 7 6 5 4 3 2) 1 


9.6.7 Fermat's Little Theorem 


The French lawyer and mathematician Pierre de Fermat (1601—1665), wrote to a 
friend in 1640 that he had discovered the following connection: If p is a prime 
number, then for any integer a, the number а? — a is an integer multiple of p. In 
modular arithmetic this is written: 


а? =a(mod р), ає 2, pisa prime number 


Table 9.14 shows some examples of Fermat's Little Theorem. 


9.7 Applications of Modular Arithmetic 


9.7.1 ISBN Parity Check 


When an ISBN (International Standard Book Number) is assigned to a text-based 
monographic publication, it now always consists of 13 digits. Each ISBN consists 
of 5 elements with each section being separated by spaces or hyphens. Three of the 
five elements may be of varying length. 
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Table 9.14 Examples of Fermat’s Little Theorem 


а(р а? aP—a a? (mod p) | a (mod p) 
|2 4 2 0 0 
2 |75 8 6 2) 2 
2 | 8 32 30 2 2 
22 128 126 2 2) 
| 2 9 6 1 1 
3) || 3 27 24 0 0 
3 | 5 243 240 3 3 
5 || 7 2,187 2,184 3 3 
з |7 25 20 1 1 
5 | 3 125 120 2) 2 
SA ES 3,125 3,120 0 0 
5 | 7 | 78,125 78,120 5 5 
93 |2 49 42 1 1 
3 | 3 343 336 1 1 
ENS: 16,807 16,800 2) 2 
7 | 7 | 823,543 | 823,536 0 0 


For example, the ISBN for the first edition of Foundation Mathematics for Com- 
puter Science is 978-3-319-21436-8. The first 3 digits can either be 978 or 979; 3 
identifies the country, geographical region, or language area; 319 identifies the par- 
ticular publisher or imprint; 21436 identifies the particular edition and format of a 
specific title; 8 is a check digit that validates the rest of the number using modulo 10 
with alternate weights of 1 and 3. If the remainder isn’t zero, it is subtracted from 
10, giving the check digit: 


[9x )--(7х3)-(8х 10) + (3 х3) + (3х1) + (1х 3) 

+ (9 х 1) + (2 x 3) + (1 x 1) + (4 x 3) + (3 x 1) (6 х 3)] (mod 10) 
=94+214+84+943434+9+6+1+12+3+ 18 (mod 10) 

= 102 (mod 10) 

=2 


As 2 isn’t zero, it is subtracted from 10 giving the check digit 8, as used in the ISBN. 
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9.7.2 IBAN Check Digits 


IBAN (International Bank Account Number) validation through control digits is used 
as an effective way of reducing failed transactions when processing international 
and domestic payments. Three types of modulus checks are performed on UK bank 
accounts: 


Mod 10-Standard 10 modulus check 
Mod 11-Standard 11 modulus check 
DBIAI—Double alternate modulus check 


Table 9.15 shows the notation used to define the specific digits within sort codes 
and account numbers. 

For the standard (10 and 11) modulus check process, each digit of the combined 
sort code and account number is multiplied by a weight. These values are summed 
and divided by the specified modulus. If the result has no remainder, the sort code 
and account number are valid. For example, using the sort code and account number: 
000000 58177632, Table 9.16 shows the calculation, where each digit of the IBAN 
number is multiplied by its weight and summed. This results in: 


0--0--0--0--0--0--35--40--8--21--28--36--6--2- 176 


The IBAN system specifies that the sum 176 is working with a modulus of 11, which 
gives a remainder of zero, and validates the sort code and account number. 


Table 9.15 IBAN Notation 


Sort code Account number 


Dict 1121345061 21345 6 78 


Weight |u|v|w|x y|z|a bj|c|d|e f|g|h 


Table 9.16 IBAN Standard modulus check process 


Sort code Account number 
Digit оо о ооо 5 8 ОЕ 7 б з? 
Weight оо ооо 7 50158-1153 4 6 1211 
Weighted 41215100 0 0,0 0 35 40 8 21 28 36 6 2 
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Table 9.17 IBAN Double alternate modulus check process 


Sort code Account number 
Digit 4 |9 D 2 7 sy | al || ae |) sy || zh 5 6 7 8 
Weight ІІ 2 1 2 ПІРІ” il 2 1 2 1 
Weighted digits | 8 | 9 | 1812043202064 10} 6 Г 14 |8 


For the double alternate modulus check process, each digit of the combined sort 
code and account number is multiplied by an alternating weight of 2 and 1. This 
time the single digits are summed and divided by 10. If the result has no remainder, 
the sort code and account number are valid. For example, using the sort code and 
account number: 499273 12345678, Table 9.17 shows the calculation, where each 
digit of the IBAN number is multiplied by its weight and summed. This results in: 


8+9+1+8+2+1+4+3+2+2+6+4+1+0+6+1+4+8= 70 


The IBAN system specifies that the sum 70 is working with a modulus of 10, which 
gives a remainder of zero, and validates the sort code and account number. 

The weights and modulus are stored in a large table and vary according to the sort 
code range. For example, for sort codes between 090190 and 090196, the modulus 
is 10 and the weights аге 0, 0, 3, 7, 1, 3, 7, 1, 3, 7, 1, 3, 7, 1. Whereas for sort codes 
between 090720 and 090726, the modulus is 11 and the weights are 0, 0, 0, 0, 0, 0, 
9, 8, 7, 6, 5, 4, 3, 2, 1. 

(www.vocalink.com) contains a complete description of the modulus-based 
checking system. 


9.8 Worked Examples 


9.8.1 Negative Numbers 


Calculate the remainders of the following negative numbers: 


—20 (mod 4), - 13 (mod 7), —6 (mod 9), —23 (mod 5) 
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Solution 
Use the the formula a = k x n +r: 


—20 (mod 4) = —5 x 4+0 = 0 
—13 (mod 7) = -2x 74-121 
—6 (mod 9) = —1 x 9+3 = 3 
—23 (mod 5) = —5 x 5+2 = 2 


9.8.2 Sums of Numbers 


Calculate the remainder of the following summations using the individual remain- 
ders: 


23+5+19+41 +29+ 2 + 19 + 44 (mod 7) 
123 + 345 + 678 + 910 (mod 11) 


Solution 
Find the remainder of the sum the individual remainders: 


23 (mod 7) = 2 

5(mod 7) = 5 
19 (mod 7) = 5 
41 (mod 7) = 6 
29 (mod 7) = 1 

2 (mod 7) = 2 
19 (mod 7) = 5 
49 (mod 7) = 0 


2+5+5+6+1+2 + 5) (mod 7) = 5 


123 (mod 11) = 2 
345 (mod 11) = 4 
678 (mod 11) = 7 
910 (mod 11) = 8 
(2+4+7-+ 8) (mod 11) = 10 
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9.8.3 Remainders of Products 


Calculate the remainders of the following products: 
(123, 456 x 789, 101) (mod 11) 
(101, 101 х 202, 202) (mod 13) 


Solution 
Find the remainder of the product of the individual remainders: 


123, 456 (mod 11) = 3 
789, 101 (mod 11) = 5 
(3 x 5) (mod 11) = 4 


101, 102 (mod 13) = 1 


202, 203 (mod 13) = 1 
(1 x 1) (mod 13) = 1 


9.8.4 Multiplicative Inverse 


Using Table 9.5 find the multiplicative inverse of the following using modulo 7: 


Solution 
Look for entries in Table 9.5 containing а 1, otherwise there is no multiplicative 


inverse: 


: 4 x 2 (mod 7) = 1, therefore 1 x 2 (mod 7) = 2 


Ale 


: 4 x 2 (mod 7) = 1, therefore — = 2 x 2 (mod 7) = 4 


vlww AIN Ble 


: 5 x 3 (mod 7) = 1, therefore - = 3 x 3 (mod 7) = 2 


шоро мо AIN 


5 
:3 x 5 (mod 7) = 1, therefore 3 = § x 5 (mod 7) = 4 
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9.8.5 Product Table for Modulo 13 


Construct the product table for modulo 13 and show the entries that permit a 
multiplicative inverse. 


Solution 


Table 9.18 shows the products for modulo 13 with the entries equal to 1, highlighted 
in red. 


9.8.6 ISBN Check Digit 


The author's book Imaginary Mathematics for Computer Science has the ISBN 978- 
3-319-94636-8 (Vince 2018). Using the algorithm described above, confirm the value 
of the check digit. 


Table 9.18 Products (mod 13) 


x М 2 3 4 5 6 7 8 О ЛО о 


0 0 0 0 0 0 0 0 0 0 0 0 


1 2) 3 4 5 6 7 8 “% оу 


2) 4 6 8 10 | 12 | 1 3 5 ТІ 91/1 


5 10 2 7 12 | 4 9 1 6 iil 3 8 
ІШІ 11 |; 4 | 10) 3 9 2 8 1 1 


Т) 1 8 2 9 31004 1 5 12/6 


11 ШЕ © 7 5 3 1 12110 8 6 4 2 


(е 
е) (€) 5 9 ы ә ы с в с € € ә 
о 


12 ІШІ ШИ || 10| % 8 7 6 5 4 3 2) 1 
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Solution 


[9 x D) (7х3) + (8x 0+ (3 х3) + (3х) +(1х 3) 

+ (9 x D) (9х 3) + (4 x 1D) + (6 x 3) + (3 x D) + (6 x 3)] (mod 10) 
= [9+21+8+9+3 4-3 49 4-27 4- 4 4-18 + 3 + 18] (mod 10) 

— 132 (mod 10) 

=2 


As 2 isn’t zero, it is subtracted from 10 giving the check digit 8, as used in the ISBN. 
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Chapter 10 A) 
Trigonometry cheret 


10.1 Introduction 


This chapter covers some basic features of trigonometry such as angular measure, 
trigonometric ratios, inverse ratios, trigonometric identities and various rules, with 
which the reader should be familiar. 


10.2 Background 


The word trigonometry divides into three parts: ‘tri’, ‘gon’, ‘metry’, which means 
the measurement of three-sided polygons, i.e. triangles. It is an ancient subject and 
is used across all branches of mathematics. 


10.3 Units of Angular Measurement 


The measurement of angles is at the heart of trigonometry, and today two units 
of angular measurement have survived into modern usage: degrees and radians. 
The degree (or sexagesimal) unit of measure derives from defining one complete 
rotation as 360°. Each degree divides into 60 min, and each minute divides into 60s. 
The number 60 has survived from Mesopotamian days and is rather incongruous 
when used alongside today’s decimal system. The radian, however, is natural to 
mathematics. 

The radian of angular measure does not depend upon any arbitrary constant—it 
is the angle created by a circular arc whose length is equal to the circle’s radius. 
And because the perimeter of a circle is 2лг, 2x radians correspond to one com- 
plete rotation. As 360° correspond to 2л radians, | radian equals 180°/z, which is 
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approximately 57.3°. The following relationships between radians and degrees are 
worth remembering: 


2 rad = 90° л rad = 180° 
3л 
F rad = 270° 2л rad = 360° 
lrad ~ 57.29578° 1° ~ 0.017 4533 rad 
To convert x° to radians: > 
TX 
— rad 
180 
To convert x rad to degrees: 
180x° 
л 


10.4 Тһе Trigonometric Ratios 


Ancient civilisations knew that triangles—whatever their size—possessed some 
inherent properties, especially the ratios of sides and their associated angles. This 
means that if these trigonometric ratios are known in advance, problems involving 
triangles with unknown lengths and angles, can be discovered using these ratios. 

Figure 10.1 shows a point P with coordinates (base, height), on a unit-radius 
circle rotated through an angle 0. As P is rotated, it moves into the 2nd quadrant, 3rd 
quadrant, 4th quadrant and returns back to the first quadrant. During the rotation, the 
sign of height and base change as follows: 


Fig. 10.1 The four 


quadrants for the jg rodun 3 1 d 
trigonometric ratios 2nd quadrant Ж й М, асап 
rd 
РА \ 
j = 
/ height 


base 
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Ist quadrant: height(+), base(+) 
2nd quadrant: height(+), base(—) 
3rd quadrant: height(—), base(—) 
4th quadrant: height(—), base(+) 


Figures 10.2 and 10.3 plot the changing values of height and base over the four 
quadrants, respectively. 

When radius = 1, the curves vary between | and —1. In the context of triangles, 
the sides are labelled as follows: 


hypotenuse = radius 
opposite = height 


adjacent = base 


Thus, using the right-angle triangle shown in Fig. 10.4, the trigonometric ratios: 
sine, cosine and tangent are defined as: 


. opposite adjacent opposite 
5100 = ———————, с050 = ———————, tanl = ———— 
hypotenuse hypotenuse adjacent 
Fig. 10.2 The graph of laste 
height over the four 
quadrants 
height 1st 2nd 
quadrant quadrant 
3rd Ath 
quadrant quadrant 
Fig. 10.3 The graph of base E LL 


over the four quadrants 


base Ath 


quadrant 


Ist 
quadrant 


2nd 3rd 
quadrant | quadrant 
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Fig. 10.4 Sides of a 
right-angle triangle 


hypotenuse 
opposite 


adjacent 


The reciprocals of these functions, cosecant, secant and cotangent are also useful: 


1 1 
csc = ——, зесд= , cotü = 
sin 0 cos 9 tan 0 


As an example, Fig. 10.5 shows a triangle where the hypotenuse and an angle are 
known. The other sides are calculated as follows: 


opposite _ sin 40° 
10 
opposite = 10 sin 40° = 10 х 0.64278 = 6.4278 
di 
SEHEN, aed 
10 


adjacent = 10 cos 40° © 10 x 0.7660 = 7.660 
The theorem of Pythagoras confirms that these lengths are correct: 
6.4278? + 7.660? ~ 10? 
Figure 10.6 shows the graph of the tangent function, which, like the sine and cosine 


functions, is periodic, but with only a period of л radians. 


Fig. 10.5 A right-angle 
triangle with two unknown 
sides 


opposite 


40° 


adjacent 
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Fig. 10.6 Graph of the 
tangent function 


10.4.1 Domains and Ranges 


The periodic nature of sin 0, cos 0 and tan 0, means that their domains are infinitely 
large. Consequently, it is customary to confine the domain of sin 0 to: 


EH 


(0, л] 


and cos 0 to: 


The range for both sin Ө and cos 0 is: 
[—1, 1] 
The domain for tan 0 is the open interval: 
l-> al 
2” 22 
and its range is the open interval: 


]—оо, oo[ 


10.5 Inverse Trigonometric Functions 


The functions sin 0, cos 6, tan 0, csc 0, sec 0 and cot 0 provide different ratios for the 

angle 9, and the inverse trigonometric functions convert a ratio back into an angle. 

These are arcsin, arccos, arctan, arccsc, arcsec and arccot, and are sometimes written 
‚1—1 1 1 

as sin 


, cos, tan^!, csc~!, вес”! and cot~!. For example, sin 30° = 0.5, therefore, 
arcsin(0.5) = 30°. Consequently, the domain for arcsin is the range for sin: 
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[-1, 1] 
and the range for arcsin is the domain for sin: 
EH 
2'2 
as shown in Fig. 10.7. Similarly, the domain for arccos is the range for cos: 
[—1, 1] 
and the range for arccos is the domain for cos: 
(0, л] 


as shown in Fig. 10.8. 
The domain for arctan is the range for tan: 


1- оо, oo[ 


Fig. 10.7 Graph of the arcsin x A 
arcsin function nid | =2===255 


— -л/2 


Fig. 10.8 Graph of the arccos ХА 
arccos function 
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Fig. 10.9 Graph of the arctan x 
arctan function US b enciiceetees 


and the range for arctan is the domain for tan: 


л л 
| 2” 2 | 
as shown іп Fig. 10.9. 

Various programming languages include the atan2 function, which is an arctan 
function with two arguments: atan2(y, x). The signs of x and y provide sufficient 
information to locate the quadrant containing the angle, and gives the atan2 function 
a range of [0, 277]. 


10.6 Trigonometric Identities 


The sin and cos curves are identical, apart from being displaced by 90°, and are 
related by: 
cos 9 = sin(0 + 2/2) 


Also, simple algebra and the theorem of Pythagoras can be used to derive other 
trigonometric identities such as: 


sin Ө 
(ап 0 


cos Ө = 

sin? 6 + cos? 0 = 1 
1 + tan? 0 = sec? 0 
1 cot? 6 = csc? 0 
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10.7 The Sine Rule 


Figure 10.10 shows a triangle labeled such that side a is opposite angle A, side b is 
opposite angle B, etc. The sine rule is: 


а _ b _ с 
sinA  sinB sinC 


which can be used to compute the length of an unknown length or angle. For example, 
if A = 60°, B = 40°, C = 80°, and b = 10, then: 


a 10 


sin60° сіп 40° 


rearranging, we have: 
10 sin 60° 
а----- 


sin 40° 


А 13.47 


Similarly: 
с 10 


sin 80° sin 40° 


therefore: 
Е 10 sin 80° 


sin 40° 


А 15.32 


10.8 The Cosine Rule 


The cosine rule expresses the sin? 0 + соз? Ө = 1 identity for the arbitrary triangle 
shown in Fig. 10.10. In fact, there are three versions: 


Fig. 10.10 А triangle 
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а? =? + с? —2bccosA 
2 = с? ya — 2са cos В 


c? = а? +b’ —2abcos С 
Three further relationships also hold: 


а = b cos C + c cos В 
b = c cos А + a cos C 
c = acos В + bcos А 


10.9 Compound-Angle Identities 


Compound angle identities are useful for solving various mathematical problems, 
but apart from this, their proof often contains a strategy that can be used else where. 
In the first example, watch out for the technique of multiplying by 1 in the form of 
a ratio, and swapping denominators. The technique is rather elegant and suggests 
that the result was known in advance, which probably was the case. Let’s begin by 
finding a way of representing ѕіп(о + В) in terms of sino, cosa, sin В, cos В. 
With reference to Fig. 10.11: 
| FD BC+ED 
in nr PT ap 
BCAC ED CD 
X ADAC ADCD 
BCAC ED CD 
“АСАР CDAD 
sin(a + В) = sina cos В + cosa sin В (10.1) 


To find ѕіп(о — £), reverse the sign of В in (10.1): 
sin(a — В) = sina cos В — cosa sin В (10.2) 
Now let's expand соѕ(о + В) with reference to Fig. 10.11: 


(hd AE АВ- ЕС 
COS ЕБЕ 
S AD AD 
_ АВАС ЕС CD 
" ADAC АРСР 
_ АВАС EC CD 
"X ACAD CDAD 


cos(a + В) = cosa cos В — sina sin В (10.3) 
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Fig. 10.11 Тһе geometry to 
expand ѕіп(о + В) 


To find соѕ(о — В), reverse the sign of В in (10.3): 
cos(a — В) = cosa cos В + sino sin В 
To expand tan(o + В), divide (10.1) by (10.3): 


sin(a + В) sina cos В + cosa sin В 
cos(a+f) совосов В — sina sin В 
sinacosB  cosasinB 


cosacosB совосов В 
cosa cos В sin o sin В 


cosæ cos cosg cos В 
tano + tan В 


t ааш 
а 1 — tano tan В 


(10.4) 


To find tan(a — В), reverse the sign of В in (10.4): 


tana — tan f 


t E 
aoe) 1 + tano tan В 


10.9.1 Double-Angle Identities 


By making В = a, the three compound-angle identities: 


sin(a + В) = sina cos В + cosa sin В 

cos(a + В) = cosa cos В F sina sin В 
tana + tan В 

1 x tano tan В 


tan(a + В) = 
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and provide the starting point for deriving three corresponding double-angle 
identities: 


sin(a + о) = sino cos «œ + cos g sino 


sin(2a) = 2 sin а cos œ 


Similarly: 


соѕ(о + а) = cosa cosa == sina sina 


? æ — sin? a 


соѕ(20) = cos 
which can be further simplified using sin? о + cos? о = 1: 


cos(2a) = cos? а — sin? а 

cos(2a) = 2cos*a — 1 

соѕ (20) = 1 — 2 sin? a 
And for tan(2a), we have: 


tan œ + tano 


tan(a +a) = —— — ——— 
( ) 1 — tano tana 
ina) 2 tana 
ап(2о)------Ғ 
1 — tan? a 


10.9.2 Multiple-angle Identities 


In Chap. 16, Euler’s trigonometric formula shows how the following multiple-angle 
identities are computed: 


sina) = 3sina — 4 sin? о 
cos(3a) = 4 cos? a — 3 cosg 


3 tana — tan? a 
1 — 3 tan? а 


sin(4a) = 4 sin œ cosa — 8 sin? æ cos g 


tan(3a) = 


cos(4a) = 8 costa — 8 cos? œ + 1 
4tana — 4 tan? a 

1 — 6 tan? œ + tan^ o 

біп(5а) = 16 sin? o — 20sin? a + 5sing 


tan(4a) = 
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сов(5а) = 16 cos? a — 20 cos? æ + 5cosa 
5tano — lOtan? о + tan? a 


tan(Sa) = 
коа) 1 — 10tan? «œ + 5tan^ a 


10.9.3 Half-Angle Identities 


Every now and then, it is necessary to compute the half-angle identities from the 
corresponding whole-angle functions. To do this, we rearrange the double-angle 
identities as follows: 


соѕ(20) = 1 — 2sin? о 


= 1 — соѕ(20) 
sin^ à = —————— 


2 
22 1- cosg 
sin“ (0/2) = 
2 
| 1—cosa 
sin(a/2) = + 5 (10.5) 


Similarly: 
22 1 + соѕ(20) 


cos’ o 
2, 


1 
сов2(0/2) = EM 


14-coso 
cos(a/2) = +,/ m TER (10.6) 


Dividing (10.5) by (10.6) we have: 
tàn(a/2) 1- cos g 
an(a/2) = ,/ — —— 
1 + cosg 


10.10 Perimeter Relationships 


Finally, with reference to Fig. 10.10, we come to the perimeter relationships that 
integrate angles with the perimeter of a triangle: 


10.11 Worked Examples 


1 
5 = 24%%-<) 


sin(A/2) = [6 — bXs — с) 
bc 

sin(B/2) — [G— o)— а) 
ca 

sin(C /2) = [6 — а)(5 — b) 
ab 


cos(A/2) — m 0) 
C 

cos(B/2) — ет» 

cos(C/2) = = g 


sinA — „тт a)(s — b)(s — с) 
bc 


sin B — Е a)(s — b)(s — с) 
ca 


sin С = ет a)(s — b)(s — c) 
ab 


10.11 Worked Examples 


10.11.1 Degrees to Radians 


Convert 20? to radians. 


Solution 


_ 20хл 
2222180 
= 0.349066 rad 


10.11.2 Sine Rule 


Find the angles o and В using the sine rule in Fig. 10.12. 


191 
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Fig. 10.12 А right-angled 
triangle 


Solution 
Using the sine rule: 


AE. 
sina віп 90° 
"E. 
sina = = 
а ғ 36.870? 
4 5 
sing 5їп90° 
"T. 
sin B — 5 
В ~ 53.130° 


10.11.3 Cosine Rule 


Find the angles о and В using the cosine rule in Fig. 10.13. 


Fig. 10.13 А right-angled 
triangle 


10.11 Worked Examples 


Solution 
Using the cosine rule: 


52 = 132 + 12? — 2 х 18 x 12 cosa 


25 = 169 + 144 — 312 cosa 
—288 
cosa = ——— 
—312 
a А 22.620° 


122 = 5? + 13? 2 х 5 x 13 cos В 
144 = 25 + 169 — 130cos B 


—50 
cos B = 1130 
В = 67.380° 


10.11.4 Compound Angle 


Give an example of sin(a + В) = sina cos В + cosa sin В. 


Solution 
Let a = 30°, В = 60°, then: 


sin(a + В) = sina cos В + cosa sin В 
sin(30° + 60°) = sin 30? cos 60° + cos 30° sin 60° 
КЕКЕ 
1= 0.5 x 0.5 + — = 
x + 55 
1 = 0.25 + 0.75 


10.11.5 Double-Angle Identity 


2 2 


Give an example of cos(2a) = cos“ a — sin’ о. 


Solution 
Let a = 30°: 


193 


194 


2 2 


соѕ(20) = cos“ a — sin^ а 


cos(2 x 30°) = cos? 30° — sin? 30° 


3 1 

6° =>- - 

COS 4 4 
1_1 
7-2 


10.11.6 Perimeter Relationship 


: : __ (s—b)(s—c) 
Give an example of sin(A/2) = y =g ~~. 


Solution 
Using Fig. 10.12 where a = 5, b=3, с= 4: 


== (a b o 
1 
= 09-354) 
=6 
sin(A/2) = |65 — bs — с) 
bc 
. rn Toss MB) 
sin(90 /2)= ЕГЕТЕ 


in45° = |/ E 
sin 45° = ,/ — 
12 


v2 42 
2 


2 
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Chapter 11 A) 
Coordinate Systems geu 


11.1 Introduction 


This chapter revises coordinate systems which include Cartesian coordinates, polar, 
spherical polar, cylindrical and barycentric coordinate systems. It also includes the 
distance between two points in space, and the area of simple 2D shapes. It concludes 
with a collection of worked examples. 


11.2 Background 


René Descartes is often credited with the invention of the xy-plane, but Pierre de 
Fermat was probably the first inventor. In 1636 Fermat was working on a treatise 
titled Ad locus planos et solidos isagoge, which outlined what we now call ‘analytic 
geometry’. Unfortunately, Fermat never published his treatise, although he shared 
his ideas with other mathematicians such as Blaise Pascal (1623—1662). At the same 
time, Descartes devised his own system of analytic geometry and in 1637 published 
his results in the prestigious journal Géométrie. In the eyes of the scientific world, 
the publication date of a technical paper determines when a new idea or invention is 
released into the public domain. Consequently, ever since this publication Descartes 
has been associated with the xy-plane, which is why it is called the Cartesian plane. 

The Cartesian plane is such a simple idea that it is strange that it took so long 
to be discovered. However, although it is true that René Descartes showed how 
an orthogonal coordinate system could be used for graphs and coordinate geometry, 
coordinates had been used by ancient Egyptians, almost 2000 years earlier! If Fermat 
had been more efficient in publishing his research results, the x y-plane could have 
been called the Fermatian plane (Boyer and Merzbach 2011)! 
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Fig. 11.1 Тһе Cartesian УА 
plane 3 


КЕР) 


xy 


(41-2) 


11.3 The Cartesian Plane 


The Cartesian plane provides a mechanism for locating points with a unique, ordered 
pair of numbers (x, y) as shown in Fig. 11.1, where P has coordinates (3, 2) and 
Q has coordinates (-4, —2). The point (0, 0) is called the origin. As previously 
mentioned, Descartes suggested that the letters x and y should be used to represent 
variables, and letters at the other end of the alphabet should stand for numbers. Which 
is why equations such as y = ах? + bx + c, are written this way. 

The axes are said to be oriented as the x-axis rotates anticlockwise towards the 
y-axis. They could have been oriented in the opposite sense, with the y-axis rotating 
anticlockwise towards the x-axis. 


11.44 Function Graphs 


When functions such as: 


linear: y = mx + с 
quadratic: у = ах? + bx + c 
cubic: у = ax? + bx? +cx +d 
trigonometric: y = a sin x 


are drawn as graphs, they create familiar shapes that permit the function to be easily 
identified. Linear functions are straight lines; quadratics are parabolas; cubics have 
an ‘S’ shape; and trigonometric functions often possess a wave-like trace. Figure 
11.2 shows examples of each type of function. 
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Fig. 11.2 Graphs of four F 
function types | | 


quadratic 


xy 


A => 


1пиѕоіа 


122] 


11.5 Shape Representation 


The Cartesian plane also provides a way to represent 2D shapes numerically, which 
permits them to be manipulated mathematically. Let’s begin with 2D polygons and 
show how their internal area can be calculated. 


11.5.1 2D Polygons 


A 2D polygon is formed from a chain of vertices (points) as shown in Fig. 11.3. A 
straight line is assumed to connect each pair of neighbouring vertices; intermediate 
points on the line are not explicitly stored. There is no convention for starting a chain 
of vertices, but software will often dictate whether polygons have a clockwise or 
anticlockwise vertex sequence. 

We can now subject this list of coordinates to a variety of arithmetic and mathe- 
matical operations. For example, if we double the values of x and y and redraw the 
vertices, we discover that the shape’s geometric integrity is preserved, but its size 


Fig. 11.3 A polygon created 


Ма 


from а chain of vertices ^ (1, 3) 
(3, 2) 
2 
и (LI (3, 1) 
i 2 3 4 5% 
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is doubled relative to the origin. Similarly, if we divide the values of x and y by 2, 
the shape is still preserved, but its size is halved relative to the origin. On the other 
hand, if we add 1 to every x-coordinate, and 2 to every y-coordinate, and redraw the 
vertices, the shape's size remains the same but is displaced 1 unit horizontally and 2 
units vertically. 


11.5.2 Areas of a Polygonal Shapes 


The area of a polygonal shape is readily calculated from its list of coordinates. For 
example, using the list of coordinates shown in Table 11.1, the area is computed by: 


1 
area — 2 1007! = xiyo) + Gi ya — xay1) + (X23 - X32) + (xayo — xoya)] 


You will observe that the calculation sums the results of multiplying an x by the 
next y, minus the next x by the previous y. When the last vertex is selected, it is 
paired with the first vertex to complete the process. The result is then halved to reveal 
the area. As a simple test, let’s apply this formula to the shape described in Fig. 11.3: 


1 
аға-201х1-3х1)4(3х2-3х1)4(3х3-1х2)5(1х1-1х3) 
1 
area = а = 3 
which, by inspection, is the true area. The beauty of this technique is that it works 
with any number of vertices and any arbitrary shape. 
Another feature of the technique is that if the set of coordinates is clockwise, 
the area is negative, which means that the calculation computes vertex orientation as 


well as area. To illustrate this feature, the original vertices are reversed to a clockwise 
sequence as follows: 


1 
area = А хз-1х1+(1х2-3х3) + 3х1 3х2) + Gx 1- 1x D] 


1 
агеа = 2Е-7-9421 =-3 
The minus sign confirms that the vertices are in a clockwise sequence. 


Table 11.1 А polygon’s coordinates 


goo 
Xo Yo 
от 
x2 | у 
X3 ya 
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Fig. 11.4 Calculating the y 
distance between two points 


му 


11.5.3 Theorem of Pythagoras т 2D 


The theorem of Pythagoras is used to calculate the distance between two points. 
Figure 11.4 shows two arbitrary points Р(х, уі) and P2(x2, y2). The distance 
Ax = x? — xı and Ay = y» — yı. Therefore, the distance d between P, and Р» is 


given by: 
d = (Ax)? + (ду)? 


For example, given Pi(1, 1), P»(4, 5), then d = v3? + 42 = 5. 


11.5.4 Pythagorean Triples 


A Pythagorean triple states the theorem of Pythagoras using integers. For example, 
these are the first few: 


3.425, 9+ 16 = 25 

62 + 82 = 10°, 36 + 64 = 100 
92 + 122 = 152, 81+ 144 = 225 
5 + 12? = 13", 25 + 144 = 169 
7? + 24? = 25°, 49 + 576 = 625 


Pierre de Fermat proposed that there is no integer solution to a” + b” = с", with 
n > 2, but he did not provide a solution. The English mathematician Sir Andrew 
John Wiles (1953-), provided a solution in 1994, what is known as Fermat's Last 
Theorem. 
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Fig. 11.5 a A left-handed y 
system. b A right-handed 
system 

X 

(b) 
2 x 
(a) 
x 2 


11.6 3D Cartesian Coordinates 


Two Cartesian coordinates are required to locate a point on the 2D Cartesian plane, 
and three Cartesian coordinates are required for 3D space. The corresponding axial 
system requires three mutually perpendicular axes; however, there are two ways to 
add the extra z-axis. Figure 11.5 shows the two orientations, which are described as 
left- and right-handed axial systems. The left-handed system permits us to align 
our left hand with the axes such that the thumb aligns with the x-axis, the first finger 
aligns with the y-axis, and the middle finger aligns with the z-axis. The right-handed 
system permits the same system of alignment, but using our right hand. The choice 
between these axial systems is arbitrary, but one should be aware of the system 
employed by commercial computer graphics packages. The main problem arises 
when projecting 3D points onto a 2D plane, which has an oriented axial system. A 
right-handed system is employed throughout this book, as shown in Fig. 11.6, which 
also shows a point P with its coordinates. It also worth noting that handedness has 
no meaning in spaces with 4 dimensions or more. 


Fig. 11.6 А right-handed 
axial system showing the 
coordinates of a point P 
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11.6.1 Theorem of Pythagoras іп 3D 


The theorem of Pythagoras in 3D is a natural extension of the 2D rule. In fact, 
it even works in higher dimensions. Given two arbitrary points Р(х, уі, 21) and 
Р(х, yo, 22), We compute Ax = x2 — хи, Ay = y» — yı and Az = 52 — zi, from 
which the distance d between Р, and P; is given by: 


d = // (Ах)? + (Ay)? + (Az? 


and the distance from the origin to a point P(x, y, z) is simply: 


d= /х2 + у2 + 2 


Therefore, the point (3, 4, 5) is М3? + 42 + 52 ~ 7.07 from the origin. 


11.7 Polar Coordinates 


Polar coordinates are used for handling data containing angles, rather than linear 
offsets. Figure 11.7 shows the convention used for 2D polar coordinates, where the 
point P(x, y) has equivalent polar coordinates Р(р, 0), where: 

x = рсов0 

y = psinO 

р = Ух? + у? 


9 = arctan(y/x) 


For example, the point О (4, 0.87) in Fig. 11.7 has Cartesian coordinates: 


Fig. 11.7 2D polar У 
coordinates Q(4, 0.87) P(p, 0) = P(x, y) 
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Fig. 11.8 Nine 
Archimedean spirals 


x = 4с05(0.8л) + —3.24 
у = 4в5іп(0.8л) А 2.35 


and the point (3, 4) has polar coordinates: 


о =у3?+42 = 5 


Ө = arctan(4/3) = 53.13? 


These conversion formulae work only for the first quadrant. The atan2 function 
should be used in a software environment, as it works with all four quadrants. 
Figure 11.8 shows nine Archimedean spirals using polar coordinates, created 
by: 
p — k0, k=2, 4, 6, 8, 10, 12, 14, 16, 18 


11.8 Spherical Polar Coordinates 


Figure 11.9 shows one convention used for spherical polar coordinates, where the 
point P(x, у, 2) has equivalent polar coordinates P(p, ф, 0), where: 


x = p sin ġ cos 0 

y = psing sin 

z = p cos ġo 

р = уз? +y +z 
ф = arccos(z/p) 

9 = arctan(y/x) 
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Fig. 11.9 Spherical polar 
coordinates 


For example, the point (3, 4, 0) has spherical polar coordinates (5, 90°, 53.13?): 


р= ү32 + 42 +0 = 5 
ф = агссоѕ(0/5) = 90° 
Ө = arctan(4/3) = 53.13° 
Take great care when using spherical coordinates, as authors often swap ф with 0, as 


well as the alignment of the Cartesian axes; not to mention using a left-handed axial 
system in preference to a right-handed system! 


11.9 Cylindrical Coordinates 


Figure 11.10 shows one convention used for cylindrical coordinates, where the 
point P(x, y, z) has equivalent cylindrical coordinates P(p, 0, z), where: 


Fig. 11.10 Cylindrical 
coordinates 


Р, 9,2)ЕРбь у, 2) 
Lace 


7 >y 
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x = рсов0 
y=psing 
2-2 


о = ү? + у? 


9 = arctan(y/x) 


For example, the point (3, 4, 6) has cylindrical coordinates (5, 53.13°, 6): 


о= 32 +42 = 5 
Ө = arctan(4/3) = 53.13? 
2-6 


Again, be careful when using cylindrical coordinates to ensure compatibility. 


11.10 Barycentric Coordinates 


Barycentric coordinates locate a point in space relative to existing points, rather than 
to an origin, and are also known as local coordinates. The German mathematician 
August Mobius (1790-1868), is credited with their invention. 

Given two points in 2D space А(ха, уа) and В(хь, yp), an intermediate point 
Р(х, Ур) has barycentric coordinates: 


Хр = SXa + fxp 
Ур = Sya + typ 
l=s+t 


For example, when s = t = 0.5, P is halfway between A and B. In 3D, we simply 
add а z-coordinate. Given three points in 3D space: A (xa, Уа, Za), B(xp, yp, Zp) and 
C(Xc, Ус, Zc), then any point P (xp, ур, Zp) inside the triangle A BC has barycentric 
coordinates: 


Хр = Гха t+ SXp + 1% 
Ур = rya + SYb + fyc 
Zp = үа + SZp + {с 
l=r+s+t 


For example, when r = 1, s = t = 0, then Р = A. Similarly, whens = 1, r = t = 
0, then P = B. When r = s = t = 1/3, then P is located at the triangle’s centroid. 
If you want to know more about barycentric coordinates, refer to the author's book 
Mathematics for Computer Graphics (Vince 2022). 
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11.11 Homogeneous Coordinates 


Homogeneous coordinates surfaced in the early 19th century where they were inde- 
pendently proposed by Mobius, Feuerbach, Bobillier, and Pliicker. Homogeneous 
coordinates, define a point in a plane using three coordinates instead of two. This 
means that for a point (x, y) there exists a homogeneous point (xt, yt, t) where t 
is an arbitrary number. For example, the point (3, 4) has homogeneous coordinates 
(6, 8, 2), because 3 = 6/2 and 4 = 8/2. But the homogeneous point (6, 8, 2) is 
not unique to (3, 4); (12, 16, 4), (15, 20, 5) and (300, 400, 100) are all possible 
homogeneous coordinates for (3, 4). 

The reason why this coordinate system is called homogeneous is because it is 
possible to transform functions such as f(x, y) into the form f(x/t, y/t) without 
disturbing the degree of the curve. To the non-mathematician this may not seem 
anything to get excited about, but in the field of projective geometry it is a very 
powerful concept. 

In 3D, a point (x, у, z) becomes (xf, yt, zt, t) and for may applications t = 1, 
which seems a futile operation, but in matrix theory it is very useful, as we will 
discover. 


11.12 Worked Examples 


11.12.1 Area of a Shape 


Compute the area and orientation of the shape defined by the coordinates in Table 
11:2. 


Solution 


1 
area = 5[(2x 2-0 x 2) + (2x2-2x1)+(1x1-2x1)+(1x1-—1x0)] 
1 
= —(4+2—1+1) 
2 
=3 
The shape is oriented anticlockwise, as the area is positive. 


Table 11.2 Coordinates of the shape 


uM 0 2 2 1 1 O0 
у 0,02 2 1,1 
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11.12.2 Distance Between Two Points 


Find the distance dj? between P;(1, 1) and P5(6, 7), and азд between P3(1, 1, 1) 
and P4(7, 8, 9). 


Solution 


di; = /(6— 1? + (7 — 1? = v61 я 7.81 
dy = y (1 — 1? + (8 – 1)2 + (9 — 1? = V149 = 1221 


11.12.3 Polar Coordinates 


Convert the 2D polar coordinates (3, 2/2) to Cartesian form, and the point (4, 5) 
to polar form. 


Solution 
p=3 
0 = л/2 
x = pcos = 3 соѕ(л/2) = 0 
у = р sin = Ззщ(л/2) = 3 
Therefore, (3, 7/2) = (0, 3). 
x=4 
y=) 


о = үх + у? = /4? + 52 x 64 
Ө = arctan(y/x) = arctan(5/4) = 51.34? 


Therefore, (4, 5) ғ (6.4, 51.34°). 


11.12.4 Spherical Polar Coordinates 


Convert the spherical polar coordinates (10, 2/2, 45°) to Cartesian form, and the 
point (3, 4, 5) to spherical form. 
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Solution 
0-10 
ф=л/2 
9 = 45° 


x = psin$ cos = 105іп(л/2) cos 45° = 10-/2/2 = 7.07 
у = psingsin@ = 10 sin(x /2) sin 45° = 104/2/2 = 7.07 
z = pcos¢ = 10сов(л/2) = 0 


Therefore, (10, 1/2, 45°) ^ (7.07, 7.07, 0). 


x=3 
y=4 
2-23 


о = /х? + у? + :2 = /3? + 42 + 52 = 7.07 
ф = arccos(z/p) А arccos(5/7.07) = 45° 
Ө = arctan(y/x) = arctan(4/3) ~ 53.13? 


Therefore, (3, 4, 5) & (7.07, 45?, 53.13?). 


11.12.5 Cylindrical Coordinates 


Convert the 3D cylindrical coordinates (10, 2/2, 5) to Cartesian form, and the point 
(3, 4, 5) to cylindrical form. 


Solution 
p=10 
0 = л/2 
КО 


x = pcosé = 10сов(л/2) = 0 
y = psin0 = lOsin(z/2) = 10 
z=5 


Therefore, (10, 7/2, 5) © (0, 10, 5). 
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Table 11.3 Barycentric coordinates 


Point 7 5 ў 
А 1 0 0 
В 0 1 0 
С 0 0 1 

РУР 0.5 0.5 0 


о =У3? +42 = 5 
Ө = агсіап(4/3) = 53.13? 
25-2 


Therefore, (3, 4, 5) ~ (5, 53.13°, 5). 


11.12.6 Вагусепігіс Coordinates 


Given А(ха, Уа, Za), В(хь, Yo, Zp) and С(хе, ус, Zc), state the barycentric coordi- 
nates for A, В, C the points Pab, Pj. and Pea mid-way between AB, BC and СА 
respectively, and the centroid Ро. 


Solution 
Table 11.3 shows the barycentric coordinates for ABC. 


References 


Boyer C, Merzbach U (2011) A history of mathematics. John Wiley & Sons 
Vince J (2022) Mathematic for computer graphics, 6th edn. Springer 


Chapter 12 ®) 
Curves Check for 


12.1 Introduction 


This chapter is about the four curves of the conic sections, and concludes with four 
worked examples. 


12.2 Conic Sections 


The conic sections comprise the circle, ellipse, hyperbola and parabola, which result 
from a plane intersecting a cone. 


12.2.1 Circle 


A circle results when the cutting plane is horizontal to the cone as illustrated in Fig. 
12.1. Equation (12.1) shows a circle of radius ғ and centre (xe, ус): 


(х= х) + (у— у)? = 2 (12.1) 


Figure 12.2 shows a circle with a radius of 2, and equation x? + y? = 4. 


12.2.2 Ellipse 


An ellipse results when the cutting plane is at an angle to a cone as illustrated in Fig. 
12.3. Equation (12.2) shows an ellipse of radius ғ, and ғ, and centre (хо, ус): 
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Fig. 12.1 А circle - 
1 


Fig. 12.2 A circle 


> 


Fig. 12.3 An ellipse о 


(х = Xe)? (y a уг)? - 


2. 2 
Lis ry 


1 (12.2) 


The x-width of an ellipse is 2r,, and the y-width is 2r,. Figure 12.4 shows an ellipse 
with an equation (x/4)? + (y/2)? = 1. 
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Fig. 12.4 An ellipse я 


Fig. 12.5 А parabola 


Fig. 12.6 А parabola | 


№‘= 


12.2.3 Parabola 


A parabola results when the cutting plane is parallel with one side of the cone as 
illustrated in Fig. 12.5. Equation (12.3) shows a parabola of centre (xe, ус): 


(x — x? = Ap(y — yc) where p 4 0 (12.3) 


The focus is (x, + p, уг). Figure 12.6 shows a parabola with an equation x? = 4y. 
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Fig. 12.7 А hyperbola 


Fig. 12.8 A hyperbola pu yt 


12.2.4 Hyperbola 


A hyperbola reults when the cutting plane cuts both cones as illustrated in Fig. 
12.7. Equation (12.4) shows a hyperbola of centre (хе, уг) where 2а is the smallest 
distance between the curves: 


оу re) 
(x 259 200 229 _ 1 (12.4) 


Figure 12.8 shows a hyperbola with an equation (x /1)? — (у/1.5)? = 1. The broken 
straight lines are the asymptotes for the hyperbola, and the four points are marked 
with a = | and b = 1.5. The equations for the asymptotes are y = (b/a)x and y = 
—(b/a)x. 
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12.3 Worked Examples 


12.3.1 Circle 


Give the equation of a circle displaced (—1, 0), and r = 2, and draw the result. 


Solution 
GE +y =4 


Figure 12.9 shows the solution. 


12.3.2 Ellipse 


Give the equation of an ellipse displaced (—1, 0), and a horizontal width 6, and a 
vertical width of 4, and draw the result. 


Solution 

GAD! Y, _, 

x» += 
Figure 12.10 shows the solution. 
Fig. 12.9 A circle displaced yt 
(—1, 0) andr = 2 

4 2 3 kx 
-2 

Fig. 12.10 An ellipse yl 


displaced (—1, 0), with a 
horizontal width of 6, and a 
vertical width of 4 
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12.3.3 Parabola 


Give the equation of a parabola displaced (—1, 0) and p — 2. Draw the result. 


Solution 
(x + 1)? = 8у 


Figure 12.11 shows the solution. 


12.3.4  Hyperbola 


Give the equation of a hyperbola displaced (1, 0), with a = 1 and b = 2. Draw the 
result. 


Solution 
= 1 


елу у 
1 4 


Figure 12.12 shows the solution. 


Fig. 12.11 A parabola 
displaced (—1, 0) and p = 2 


Fig. 12.12 A hyperbola 
displaced (1, 0), а = 1 and 
b=2 


MCab) | (ab) |” 


„Я Cab) | (а-ЫҚ 


Chapter 13 A) 
Determinants б 


13.1 Introduction 


This chapter introduces the determinant as a mathematical construct that simplifies 
the solution of groups of simultaneous equations. The chapter begins by tracing the 
determinant’s historical development, and the reader is shown how to evaluate the 
determinant’s magnitude for real and complex values. The chapter concludes with 
some worked examples. 


13.2 Background 


When patterns of numbers or symbols occur over and over again, mathemati- 
cians often devise a way to simplify their description and assign a name to them. 


For example: 
4 
о 
I] Р; 
і=1 


is shorthand for: 
Ор Q5 Qa 04 
Pı P2 P3 P4 


and 


4 
о 
Di 
i=1 


is shorthand for: 
Pi + py + ps + pi 
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A determinant is another example of this process, and is a value derived from a 
square matrix of terms, often associated with sets of equations. Such problems were 
studied by the Babylonians around 300 BC and by the Chinese, between 200 BC 
and 100 BC. Since then many mathematicians have been associated with the evolu- 
tion of determinants and matrices, including Girolamo Cardano (1501-1576), Jan de 
Witt (1625-1672), Takakazu Seki (1642-1708), Gottfried von Leibniz, Guillaume 
ае L'Hópital (1661-1704), Augustin-Louis Cauchy (1789-1857), Pierre Laplace 
(1749-1827) and Arthur Cayley (1821-1895). To understand the rules used to com- 
pute a determinant’s value, we need to understand their origin, which is in the solution 
of sets of linear equations. 


13.3 Linear Equations with Two Variables 


Consider the linear equations with two variables (13.1) and (13.2) where we want 
to find values of x and y that satisfy both equations: 


7 = Зх +2y (13.1) 
10 = 2x + 4y (13.2) 


A standard way to resolve this problem is to multiply (13.1) by 2 and subtract (13.2) 
from (13.1), which removes the y-terms: 


14=6x+4y 
10 = 2x + 4y 
4 = 4x 
x=1 


Substituting x = 1 in (13.1) reveals the value of y: 


7=3+42y 
4=2y 
y-2 


Therefore, x — 1 and y — 2, solves (13.1) and (13.2) as follows: 


7=3x1+2x2 
10=2х1+4х2 
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The equations must be linearly independent, otherwise we only have one equation. 
For example, starting with: 


7=3x+2y 
14=6x+4y 


is a futile exercise, as the second equation is double the first, and does not provide 
any extra information. 
To find a general solution to this problem, we start with (13.3) and (13.4): 


dı = ах + biy (13.3) 
4 = ax + bay (13.4) 


To remove the y-term, we multiply (13.3) by bz and (13.4) by Бі: 


ар = ах + Һу (13.5) 
bid, = Вах + Һу (13.6) 


Next, we subtract (13.6) from (13.5): 


415» = БФ = abx = bya yx 
(ay bz — bia2)x 
_ dib — bid» 


= ———— 13.7 
* ab = biaz ( ) 


To find y, we multiply (13.3) by a» and (13.4) by ay: 


dia» = aax + разу (13.8) 
а142 = Фах + ау (13.9) 


Next, we subtract (13.8) from (13.9): 


aid — dia = ajb3y — Бау 
= (aib — b1a3)y 
аж = diaz 


LI 13.10 
4 a1 b» = biaz ( ) 


Observe that both (13.7) and (13.10) share the common denominator a,b» — bya). 
Furthermore, note the positions of a,, b1, a» and b» in the original equations: 


a, bi 


аз b 
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and the denominator is formed by cross-multiplying the diagonal terms ар» and 
subtracting the other cross-multiplied terms 51a». Placing the four terms between 
two vertical lines creates a second-order determinant whose value equals: 


Although the name was originally given by Carl Gauss, it was the French mathe- 
matician Augustin-Louis Cauchy who clarified its current modern identity. 

If the original equations were linearly related by a factor А, the determinant equals 
Zero: 


aj by 


Ха Abi 


= а/ы = bidai =0 


Observe that the numerators of (13.7) and (13.10) are also second-order determinants: 


а м а — bid; 
а bo 

and 
ш ш = aidz = diaz 
аҙ Ф 


which means that Eqs. (13.7) and (13.10) can be written using determinants: 


а bi 
4 by 
X = -—— 
ај bi 
a» bz 
а-а 
а? d» 
y = 
dı bi 
a2 bz 
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And one final piece of algebra permits the solution to be written as: 


1 
RT ANDES (13.11) 


di b, а а, а bi 
d» bz a2 d» a» bz 


Observe another pattern in (13.11) where the determinant is: 


а bi 


a b 


but the d-terms replace the x-coefficients: 


а b 
а b 
and then the y-coefficients: 
ај 4 
a» а, 
Returning to the original equations: 
7=3x+2y 
10 = 2x + 4y 


and substituting the constants in (13.11), we have: 


10 4 2 10 2 4 


which, when expanded reveals: 


X _ у _ 1 
28—20 30—14 12-4 
xy _1 
8 16 8 


making x = 1 and y = 2. 
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11= 4х + у (13.12) 
5=х + у (13.13) 


and substituting the constants іп (13.11), we have: 


which, when expanded reveals: 


x _ у _ 1 
11-5 20-1 4-1 
x y 1 
6 9 3 


Therefore, x — 2 and y — 3, solves (13.12) and (13.13) as follows: 


11=4х2+1х 3 
5=1х2+1х3 


Now let’s see how a third-order determinant arises from the coefficients of 
three equations in three unknowns. 


13.4 Linear Equations with Three Variables 


Consider the linear equations with three variables (13.14), (13.15) and (13.16) 
where we want to find the values of x, y and z that satisfy the three equations: 


13 = 3x + 2у + 22 (13.14) 
20 = 2х + 3y + 45 (13.15) 
7= 2х +у += (13.16) 


One way to resolve this problem is to multiply (13.14) by 2 and subtract (13.15), 
which removes the z-terms: 


26 = 6x +4y + 4z 
20 = 2x + 3y + 45 
б= 4х +у (13.17) 
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leaving (13.17) with two unknowns. Next, we take (13.15) and (13.16) and remove 
the z-term by multiplying (13.16) by 4 and subtracting (13.15): 


28 = 8x +4y + 4z 
20 = 2x + 3y + 4: 
8=6x+y (13.18) 


leaving (13.18) with two unknowns. We are now left with (13.17) and (13.18): 


6=4x+y 
8 = бх + у 
which can be solved using (13.11): 
x _ у " 1 
6 1 4 6 1 
8 1 6 8 1 
Therefore: 
6-8 
Х-----і 
4-6 
32-36 
= = 2 
4-6 


Substituting x = 1 and у = 2 in (13.14) reveals that z = 3. Therefore, x = 1, у = 2 
and z = 3 solves (13.14) (13.15) and (13.16) as follows: 


13=3х1+2х2+2х 3 
20=2х1+3х2+4х 3 
7-2х1--1х2--1х3 


We сап generalise (13.11) for three equations using third-order determinants: 


x y _ 2 _ 1 
d by с, a d c а by d a b сі 
d b о a Фф © аз b Фф аз b о 
аз Рз сз аз 43 сз аз з dà аз Рз сз 


(13.19) 
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Once again, there is an important pattern in (13.19) where the underlying determinant 
is: 


а by с 
а b о 
аз Ёз c3 


but the d-terms replace the x-coefficients: 


di bi СІ 
d by о 
di Ёз c3 


the d-terms replace the y-coefficients: 


a dj c 
а d» о 
аз dz сз 


and the d-terms replace the z-coefficients: 


ај bi di 
a» bo d» 
a3 Рз d; 


We must now find a way of computing the value of a third-order determinant, 
which requires the following algebraic analysis of three equations in three unknowns. 
We start with three linear equations: 


а = ах + biy + сг (13.20) 
d» = арх + boy + coz (13.21) 
аз = азх + bay +632 (13.22) 


and derive one equation in two unknowns from (13.20) and (13.21), and another 
from (13.21) and (13.22). 
We multiply (13.20) by c2, (13.21) by c, and subtract them: 


оа = aicax + русу + с1с25 
с\й» = сах + bocyy + с1с25 
о — с\й; = (a1c5 — слаз)х + (с; — Ь›с\)у (13.23) 


13.4 Linear Equations with Three Variables 
Next, we multiply (13.21) by сз, (13.22) Бу c» and subtract them: 


сзф = азсзх + Босзу + сосз2 
саз = азсох + Ьзсоу + сосз2 
сз - €2d3 = (азсз — азсо)х + (Босз — b3c2)y 


Simplify (13.23) by letting: 


е = са = с\й» 
Л = ас; — с1а 


gi = bic — осі 
Therefore: 

ег = fix t giy 
Simplify (13.24) by letting: 


a= сзФ = саз 
fa = а2сз — азс> 


82 = Росз — b3c2 
Therefore: 
е = fox + gay 


Now we have two equations in two unknowns: 


ei = fix + 81у 
е = fox + 82у 
which are solved using: 
X y. 1 
A B С 
where: 
ер 81 _ | edi = аф Вс — baci 


€) 82 сз — сойз b2c3 — bsc? 
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(13.24) 


(13.25) 


(13.26) 


(13.27) 


(13.28) 


224 13 Determinants 


B= fi еі = dCi = C102 cod = с\й» (13.29) 
Б е азсз — азс> C3d» — C23 

C= fi 81 = а1с2 — са» bicz — baci (13.30) 
R 82 a2C3 - a3C2 Босз — bac» 


We first compute A, from which we can derive B, because the only difference between 
(13.28) and (13.29) is that dı, do, d3 become a), a», as respectively, and b1, b2, b3 
become di, d2, d3 respectively. 

We can derive C from A, as the only difference between (13.28) and (13.30) is 
that dı, d», аз become a), a», аз respectively. Starting with A: 


А = (оа — сі) (boc — b3c5) — (русо — baci) (cad» — саз) 
= Бососзаи — bacidi — Босісзф + b3cicado 
— Русәсзй» + byc3d3 + Бостсз4> — bacicada 
= Бососзаи — bac2di + Взс1с24 — ӘісосзФ + bic2da — bzc1c2d3 
= cx(b5cadi — bzc2dı + Бзс1 42 — by c3dz + Буса; — Боса) 


A= со (а (росз - с2Ьз) — bi(doca — сой) + cy (dob3 — bod.) (13.31) 
Using the substitutions described above we can derive B and C from (13.31): 


B= со (аа (docs — соз) — bı (а2сз — саз) + cı (a2d3 — Фаз) (13.32) 


С = со (аи (bres — с26з) — by (а2сз — саз) + ci(aobs — bas) (13.33) 


We can now rewrite (13.31), (13.32) and (13.33) using determinant notation. At the 
same time, we can drop the c» terms as they cancel out when computing x, y and z: 


qa? | 4 а # (13.34) 
ӛз C3 аз C3 аз Рз 

panl ажалға ат T A (13.35) 
da C3 аз C3 a3 аз 

conl 2 Glan аа т (13.36) 
ӛз сз аз сз аз b3 
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As (13.19) and (13.27) refer to the same x and y, then: 


di bi СІ b d d b 

à b eled ^ ?|-ы 2 2 |+а| 2 № (13.37) 

РЧ b3 сз di сз d3 Ъз 

a1 di СІ d 4 

“ah eile? “=a Seal dn (13.38) 
d3 сз аз сз аз ау 

аз ds сз 

аі bi СІ b " 

m bece eme). T и (13.39) 
b3 сз аз сз аз ӛң 

аз Ёз сз 


As а consistent algebraic analysis has been pursued to derive (13.37), (13.38) and 
(13.39), a consistent pattern has surfaced in Fig. 13.1 which shows how the three 
determinants are evaluated. This pattern comprises taking each entry in the top row, 
called a cofactor, and multiplying it by the determinant of entries in rows 2 and 
3, whilst ignoring the column containing the original term, called a first minor. 
Observe that the second term of the top row is switched negative, called an inversion 
correction factor. 


d bi «a 
d b о | = 
аз b3 оз 
а di а 
a Фф о | = 
аз a сз 
a b «a 
a b о | = 
аз b сз 


Fig. 13.1 Evaluating the determinants shown in (13.37), (13.38) and (13.39) 
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Let's repeat (13.33) again without the c» term, аз it has nothing to do with the 
calculation of the determinant: 


С = ai (b2c3 - сӛз) — bı (а2сз — c2a3) + cı (a2b3 — b2a3) (13.40) 


It is possible to arrange the terms of (13.40) as a square matrix such that each row 
and column sums to C: 


a1(b5c3 — c2b3) — bı (a2c3 - саз) + cı (a2b3 — Баз) 
— а> (сз — сіз) + Бо(аусз — c1a3) — c2(a1b3 — biaz) 
a3(b1c5 — c1b3) — ba(a1c5 — c1a2) + €3(a,b2 — Буаз) 


which means that there are six ways to evaluate the determinant C: summing the rows, 
or summing the columns. Figure 13.2 shows this arrangement with the cofactors in 
blue, and the first minor determinants in green. Observe how the signs alternate 
between the terms. 

Having discovered the origins of these patterns, let’s evaluate the original equa- 
tions declared at the start of this section using (13.11): 


Fig. 13.2 The patterns of multipliers with their respective second-order determinants 
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13 = 3x + 2у + 2z 
20 = 2х + 3y + 4: 


7= 2х +у + 

х _ y 2 = 1 
di bj с a, а c а bj di a b cy 
dj bo о a, d о а bo Ф a b о 
ау Рз сз аз 43 сз аз з dà аз Рз сз 

Therefore: 
x y 2 _ 1 

13. 2 2 7 3 13 2 р 32 13 7 3 2 

20 3 4 2 20 4 2 3 20 2 3 

7 1 1 2 7 1 217 2 1 1 


computing the determinants using the top row entries as cofactors: 


X _ у _ 2 " 1 
—134-16—2  —24478—52 34-52-52 -3-12-8 


Therefore, x = 1, y = 2 and z = 3 solves (13.14), (13.15) and (13.16) as follows: 


13=3х1+2х2+2х 3 
20=2х1+3 х2 +4х 3 
7=2х1+1х2+1х 3 


13.4.1 Sarrus’s Rule 


The French mathematician Pierre Sarrus (1798-1861), discovered another way, now 
called Sarrus's rule, to compute the value of a third-order determinant, that arises 
from (13.40): 


C = а (росз — c3b3) — by (а2сз — c5a3) + сі (а2Ьз — Боаз) 


= ayboc3 — а1с26з — Буазсз + b1c5a3 + c1a5b3 — c1 b203 


= ajboc3 + bic2a3 + cqa5b3 — аусәЬз — b1a5c3 — cib3a3 (13.41) 
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Fig. 13.3 The pattern 


behind Sarrus’s rule a by сі 


а b о | = 


аз Ёз сз 


The pattern in (13.41) becomes clear in Fig. 13.3, where the first two columns 
of the matrix are repeated, and comprises two diagonal sets of terms: on the left in 
blue, we have the products a; b2c3, b1c2a3, cia» b3, and on the right in red and orange, 
the products а1с26з, b1a2c3, c1b2a3. These diagonal patterns provide a useful aide- 
mémoire when computing the determinant. Unfortunately, this rule only applies to 
third-order determinants. 


13.5 Mathematical Notation 


Having discovered the background of determinants, now let's explore a formal 
description of their structure and characteristics. 


13.5.1 Matrix 


In the following definitions, a matrix is a square array of entries, with an equal 
number of rows and columns. The entries may be numbers, vectors, complex numbers 
or even partial differentials, in the case of a Jacobian. In general, each entry is 
identified by two subscripts row col: 


Grow col 


A matrix with n rows and m columns has the following entries: 


ай ајр .. ат 
921 022 ... Am 
аһ 464,2... Anm 


The entries lying on the two diagonals are identified as follows: а, and а,, lie on 
the main diagonal, and aim and a, lie on the secondary diagonal. 
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13.5.2 Order of a Square Determinant 


The order of a square determinant equals the number of rows or columns. For 
example, a first-order determinant contains a single entry; a second-order determinant 
has two rows and two columns; and a third-order determinant has three rows and 
three columns. 


13.5.3 Value of a Determinant 


A determinant posses a unique, single value derived from its entries. The algorithms 
used to compute this value must respect the algebra associated with solving sets of 
linear equations, as discussed above. 

Pierre-Simon Laplace developed a way to expand the determinant of any order. 
The Laplace expansion is the idea described above and shown in Fig. 13.1, where 
cofactors and first minors or principal minors are used. For example, starting with 
a fourth-order determinant, when any row and column are removed, the remain- 
ing entries create a third-order determinant, called the first minor of the original 
determinant. 

The following equation is used to control the sign of each cofactor: 


(= jy oes 
which, for a fourth-order determinant creates: 


+ - + - 
- +- + 
+- +- 
- +- + 


The Laplace expansion begins by choosing a convenient row or column as the source 
of cofactors. Any zeros are particularly useful, as they cancel out any contribution by 
the first minor determinant. It then sums the products of every cofactor in the chosen 
row or column, with its associated first minor, including an appropriate inversion 
correction factor to adjust the sign changes. The final result is the determinant’ s 
value. 

A first-order determinant: 


ап | -ап 
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A second-order determinant: 


а 412 | _ 
= 411422 — 412421 


421 422 


A third-order determinant using the Laplace expansion with cofactors from the 
first row: 


ай ау dj3 


Sä 922 A23 m 921 923 +a a» 922 
a2) a22 a23 = 11 12 13 


432 азз a31 033 азр аз 
аз 432 азз 


A fourth-order determinant using the Laplace expansion with cofactors from 
the first row: 


an 023 024 2] G23 424 
ап | аз азз аза | %2 азі аз аз + 
ао d43 ам алу адз айм 
ац 412 а; 414 
азу an 94 d» an аз 
д E [44 an az UA 
13| азр аз азд 14 аз аз азз |7 
азі 3) азз азд 
ай ао ад ад ар ад 


ад a4. 443 ад 


Sarrus’s rule is useful to compute а third-order determinant: 


ай 42 413 
921 423 an | 7011022033 + 412423431 + 413421432— 
931 @32 433 

411423432 — 412421433 + 413422431 


The Laplace expansion works with higher-order determinants, as any first minor 
can itself be expanded using the same expansion. 
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13.5.4 Properties of Determinants 


We now summarise various properties of determinants. For example, if a determi- 


nant contains a row or column of zeros, the Laplace expansion implies that the value 
of the determinant is zero: 


ооо 
2 10 4|=0 
2-81 
302 
2 0 4|=0 
2 0 1 


If a determinant’s rows and columns are interchanged, the Laplace expansion also 
implies that the value of the determinant is unchanged: 


3 12 2 з. 19.79 
2 10 4|=|12 10 8 |--2 
2 8 1 2 4 1 


If any two rows, or columns, are interchanged, without changing the order of their 
entries, the determinant’s numerical value is unchanged, but its sign is reversed: 


3 12 2 
2 10 4|--2 
2 8 1 
12 3 

10 2 =2 
8 2 1 


If the entries of a row or column share a common factor, the entries may be adjusted, 
and the factor placed outside: 
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13.6 Worked Examples 


13.6.1 Determinant Expansion 


Evaluate this determinant using the Laplace expansion and Sarrus’s rule: 


14 7 
2 5 8 
369 
Solution 
Using the Laplace expansion: 
14 7 
7 
2 5 8 |=1 =2 +3 
6 9 9 8 
369 
= 1(45 — 48) — 2(36 — 42) + 3(32 — 35) 
=-—3+ 12-9 
=0 


Using Sarrus’s rule: 


=1x5x9+4x8x34+7x2x6-7x5x3-1x8x6-4x2x9 


oU N 
ON мл A 
© 00 м 


= 45 + 96 + 84 — 105 — 48 — 72 
=0 


13.6.2 Complex Determinant 


Evaluate the complex determinant: 


4-2 1-і 
2-31 3+31 
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Solution 
Using the Laplace expansion: 


ace Та. 
225 3+3i 
= (12+1#—6)—(2—1+3) 
=6+18i—5+i 
=1+19% 


13.6.3 Simple Expansion 


Write down the simplest expansion of this determinant with its value: 


an бє 
м ч ою 
o о о 


Solution 
Using the Laplace expansion with cofactors from the third column: 


13.6.4 Simultaneous Equations 


Solve the following equations using determinants: 


3=2x+y-z 
12-х--2у--2 
8 = 3x — 2y + 2z 


234 13 Determinants 


Solution 
Using (13.19): 


x _ у _ 2 _ 1 
з 1 -l1 2 3 -1 2 1 3 2 1 -1 
12 2 1 1 12 1 1 2 12 1 2 1 
8 -2 2 3 8 2 3-2 8 3 -2 2 
Therefore: 
3. L =l 
12 2 1 
- 8 -2 2 о о sp 
2 1 ц 124-148 21 
1 2 1 
3 -2 2 
2 3 -1 
12 1 
= з 8 2 в. 
2 1 4 12 +1+8 21 
1 2 1 
3 -2 2 
2 1 3 
2 12 
- 3 -2 8 _ 50+28—24__&1_ 
b 2 1 24+1+8 21 
1 2 1 
3-2 2 


Therefore, x = 2, y = 3 and z = 4, which satisfy the original equations as follows: 


3=2х2+1х3- 1х4 
12 = 1х2+2х3+1х4 
8-3х2-2х3--2х4 


Chapter 14 A) 
Vectors Check for 


14.1 Introduction 


This chapter provides a comprehensive introduction to Euclidean vectors. It covers 
a description of vector space, basis vectors, dimension, 2D and 3D vectors, unit 
vectors, position vectors, Cartesian vectors, vector magnitude, vector products, and 
area calculations. The chapter concludes with some worked examples. 


14.2 Background 


Vectors are a relative new invention in the world of mathematics, dating only from 
the 19th century. They enable us to solve complex geometric problems, the dynamics 
of moving objects, and problems involving forces and fields. 

We often only require a single number to represent quantities used in our daily 
lives such as height, age, shoe size, waist and chest measurement. The magnitude 
of these numbers depends on our age and whether we use metric or imperial units. 
Such quantities are called scalars. There are some things that require more than one 
number to represent them: wind, force, weight, velocity and sound are just a few 
examples. For example, any sailor knows that wind has a magnitude and a direction. 
The force we use to lift an object also has a value and a direction. Similarly, the 
velocity of a moving object is measured in terms of its speed (e.g. miles per hour), 
and a direction such as north-west. Sound, too, has intensity and a direction. These 
are represented by vectors. 

Complex numbers seemed to be a likely candidate for representing forces, and 
were being investigated by the Norwegian-Danish mathematician Caspar Wessel 
(1745-1818), the French amateur mathematician Jean-Robert Argand (1768-1822) 
and the English mathematician John Warren (1796-1852). At the time, complex 
numbers were two-dimensional, and their 3D form was being investigated by the 
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Irish mathematician Sir William Rowan Hamilton (1805-1865), who discovered 
them in 1843, calling them quaternions. In 1853, Hamilton published his book 
Lectures on Quaternions (Hamilton 1853) in which he described names such as 
vector, transvector and provector. Hamilton’s vectors were not widely accepted 
until in 1901, when the American mathematician Edwin Bidwell Wilson (1879- 
1964), published Vector Analysis (Wilson 1901), describing modern vector analysis. 
This was based upon a series of lectures delivered earlier by the American scientist 
Josiah Gibbs (1839-1903). 

Gibbs was not a fan of the imaginary quantities associated with Hamilton’s quater- 
nions, but saw the potential of creating a vectorial system from the imaginary i, j and 
k into the unit basis vectors i, j and k, which is what we use today. 

Some mathematicians were not happy with the direction mathematics had taken. 
The German mathematician Hermann Gunther Grassmann (1809-1877), believed 
that his own geometric calculus was far superior to Hamilton’s quaternions, but 
he died without managing to convince any of his fellow mathematicians. Fortu- 
nately, the English mathematician and philosopher William Kingdon Clifford (1845- 
1879), recognised the brilliance of Grassmann’s ideas, and formalised what today 
has become known as geometric algebra. 

With the success of Gibbs’ vector analysis, quaternions faded into obscurity, only 
to be rediscovered in the 1970s when they were employed by the flight simulation 
community to control the dynamic behaviour of a simulator’s motion platform. A 
decade later they found their way into computer graphics where they are used for 
rotations about an arbitrary axis. 

Now this does not mean that vector analysis is dead—far from it. Vast quantities of 
scientific software depends upon the vector mathematics developed over a century 
ago, and will continue to employ it for many years to come. Nevertheless, geometric 
algebra is destined to emerge as a powerful mathematical framework that could 
eventually replace vector analysis one day. 

Readers interested in the history of vector analysis should read Michael Crowe’s 
book A History of Vector Analysis (Crowe 1994). 


14.3 Vectors 


As mentioned above, vectors owe their invention to Hamilton’s discovery of quater- 
nions, which are four-dimensional objects, composed of a scalar and three imaginary 
terms i, j and К. Gibbs saw that the imaginary component could represent 3D quan- 
tities, and by using і and j, 2D quantities. Since then, vectors have been embraced 
by mathematicians, and adjusted to become a part of linear algebra. 
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14.3.1 Vector Space 


A vector space is the set of vectors that satisfy the axioms of this space, that often 
include being able to add and scale vectors. For this chapter, all vectors are Euclidean 
vectors which are used to represent force, velocity, surface normals, etc. 


14.3.2 Basis Vectors 


A basis is the set of vectors in a vector space that is used to define other vectors. 
Basis vectors are at the heart of any vectorial system, and ultimately determine the 
value of a vector’s components. For example, the standard basis used in this chapter, 
defines a 2D vector by two basis vectors е) and еҙ, where: 


e; = (1, 0) 
е = (0, 1) 


and а 3D vector by three basis vectors е), e; and ез, where: 


e; = (1, 0, 0) 
е; = (0, 1, 0) 
ез = (0, 0, 1) 


We visualise these basis vectors being mutually perpendicular such that e; is the 
x-axis, еҙ is the y-axis, and еҙ is the z-axis. These basis vectors permit any 2D vector 
v = (a, b) to be written uniquely as: 
у = ae, + be; 
and any 3D vector w = (a, b, c) to be written as: 
w = ае; + bez + сез 


For example, given u = (1, 2, 3) and w = (3, 4, 5), then: 


и = le; + 2e? + 3e; 
w = Зе; + 4е› + 563 

u +w = (1 + 3)е + (2 + 4)е + (3 + 5)ез 
= 4e, + 6e, + 863 


which uniquely describes the vectors и, w and u + w. 
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Naturally, other sets of basis vectors are possible, such as: 


е = (2, 0, 0) 
e) = (0, 3, 0) 
ез = (0, 0, 4) 


and generate the following coordinates for the above example: 


и = 2e, + без + 12е» 
w = бе; + 12e + 20е» 
u м = (2+ 6e; + (6+ 12)е + (12 + 20)ез 
= 8e, + 18e; + 32е» 


but are unnecessary for our work. 
Later in this chapter, I employ different names for the basis vectors. Instead of 
using ei, ез and еҙ, I use i, j and К. 


14.3.3 Dimension of a Vector Space 


A vector space determines the dimension of the associated vectors by controlling 
the number of elements required to define a vector. When a vector requires 2 or 3 
elements, we can visualise them as two- and three-dimensional structures, but four 
dimensions, and more, are impossible to visualise. 


14.3.4 Vector Notation 


As a vector contains two or more numbers, its symbolic name is printed either using 
a bold font, such as n, or with a normal font with an arrow, such as X.I prefer the 
bold font. 

When a scalar variable is assigned a value, we use the standard algebraic notation: 


x=5 


However, a vector has one or more numbers enclosed in brackets, written as a column 
ог as а row-in this text column vectors are used: 
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A row vector places the components horizontally: 
п = [2 3] 


The difference between the two, is appreciated in the context of matrices. Sometimes 
it is convenient-for presentation purposes-to write a column vector as a row vector, 
in which case, it is written: 

п = [2 3]T 


where е superscript T reminds us that n is really а transposed column vector. 
For this chapter, all vectors are assumed to be Euclidean vectors which have a 
standard basis. 


14.3.5 Graphical Representation of Vectors 


An arrow is used to represent a vector as it possesses length and direction, as shown in 
Fig. 14.1. By assigning coordinates to the arrow it is possible to translate the arrow's 
length and direction into two numbers. For example, in Fig. 14.2 the vector r has 
its tail defined by (хі, уі) = (1, 2), and its head by (x2, y2) = (3, 4). Vector s has 
its tail defined by (x3, y3) = (5, 3), and its head by (x4, y4) = (3, 1). The x- and 
y-components for r are computed as follows: 


Xr = X2 — Х| 


=3-1 
=2 

yr = Уз— yi 
=4-2 
=2 


Fig. 14.1 An arrow with y 
magnitude and direction 
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Fig. 14.2 Two vectors г and y 


s have the same magnitude 4 Qo V2) 
but opposite directions 
r (x3, у 3) 
3 
8 
2 Xy y) 
1 (х4, Уд) 
>= 
1 2 3 4 5 6 Ы 


The components for s are computed as follows: 


Xs = X4 — X3 
=3-5 
= —2 

У = Y4 Ууз 


It is the negative value of x, and у; that encode the vector's direction. In general, 
if the coordinates of a vector’s head and tail are (х,, у») and (х,, y;) respectively, 
its components Ax and Ay are given by: 


Ах = Xp — х; 
Ay = yn — у 
One can readily see from this notation that a vector does not have an absolute position. 


It does not matter where we place a vector, so long as we preserve its length and 
orientation. 


14.3.6 Magnitude of a Vector 


The magnitude or length of a vector r is written ||r|| and computed using the theorem 


of Pythagoras: 
Irl = V (Ax)? + (Ay)? 


and used as follows. Consider a vector defined by: 


14.3 Vectors 241 


Fig. 14.3 Eight vectors 
whose coordinates are shown 
in Table 14.1 


Nyx 


Table 14.1 Values associated with the eight vectors in Fig. 14.3 


Xh Yn Xr у Ах Ay [уесїог| 

р ооо 2 0 2 

0 2 0 0 0 p 2 
-9 0 о 0 > 0 2 

0 -2 0 0 0 -2 2) 

1 1 0 O0 1 1 A2 
-i i O0 @ =l í У? 
=| = 0 @ =i =i „2 

Оо i =i Va 


(ха, yn) = (4, 5) 
(а, y) = (1, 1) 


where the x- and y-components аге 3 апа 4 respectively. Therefore its magnitude 
equals 4/32 + 4? = 5. The magnitude of a vector is also written as |r|, with single 
vertical lines. 

Figure 14.3 shows eight vectors, and their geometric properties are listed in 
Table 14.1. 


14.3.7 3D Vectors 


The above vector examples are in 2D, but it is easy to extend this notation to embrace 
an extra dimension. Figure 14.4 shows a 3D vector r with its head, tail, components 
and magnitude annotated. The vector, its components and magnitude are given by: 
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Fig. 14.4 Тһе vector г has 
components Ax, Ay, Az a 


r=[Ax Ay Аг]? 
Ах = Xp —х, 
Ду = yn — У, 
AZ = Zn — Zt 
г! = (Ax)? + (Ay)? + (Az)? 


All future examples are three-dimensional. 


14.4 Vector Manipulation 


As vectors are different to scalars, there are rules to control how the two mathematical 
entities interact with one another. For instance, we need to consider vector addition, 
subtraction and products, and how a vector is scaled. 


14.4.1 Scaling a Vector 


Given a vector n, 2 п means that the vector's components are scaled by a factor of 2. 


For example, given: 
6 


3 

n—|4|, then 2n=|] 8 
5 10 

which seems logical. Similarly, if we divide n by 2, its components are halved. 


Note that the vector's direction remains unchanged-only its magnitude changes. In 
general, given: 
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ny Апі 
п = | и |, then An= | Ans where ЛЕК 
пз Апз 


There is no obvious way we can resolve the expression 2 + п, for it is not clear 
which component of n is to be increased by 2. However, if we can add a scalar to an 
imaginary (e.g. 2+31), why can’t we add a scalar to a vector (e.g. 2 + п)? Well, the 
answer to this question is two-fold: First, if we change the meaning of ‘add’ to mean 
‘associated with’, then there is nothing to stop us from ‘associating’ a scalar with 
a vector, like complex numbers. Second, the axioms controlling our algebra must 
be clear on this matter. Unfortunately, the axioms of traditional vector analysis do 
not support the ‘association’ of scalars with vectors in this way. However, geometric 
algebra does! Furthermore, geometric algebra even permits division by a vector, 
which does sound strange. Consequently, whilst reading the rest of this chapter keep 
an open mind about what is permitted, and what is not permitted. At the end of the 
day, virtually anything is possible, so long as we have a well-behaved axiomatic 
system. 


14.4.2 Vector Addition and Subtraction 


Given vectors г and $, г + $ is defined as: 


X, Xs Xr х; 


r—|y |. 8-|у;|, Шеп г--8- |y, + у; 


Zr 2; 2, + Zs 


a+b=b+a 
1 4 4 1 
еов.|2|-|5| = |51|-12 
3 6 6 3 


However, like scalar subtraction, vector subtraction is not commutative: 


a—bzb-a 
4 1 1 4 
e.g.| 5] — |2 £ 2|-|5 
6 3 3 6 


Let’s illustrate vector addition and subtraction with two examples. Figure 14.5 
shows the graphical interpretation of adding two vectors r and s. Note that the tail 
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Fig. 14.5 Vector addition y 
r+s 
rts 
8 
7. x 
Fig. 14.6 Vector subtraction PA 
r—s 


of vector s is attached to the head of vector r. The resultant vector t — r 4- s is 
defined by adding the corresponding components of r and s together. Figure 14.6 
shows a graphical interpretation for r — s. This time, the components of vector s are 
reversed to produce an equal and opposite vector. Then it is attached to r and added 
as described above. 


14.4.3 Position Vectors 


Given any point P(x, y, z), a position vector p is created by assuming that P is 
the vector’s head and the origin is its tail. As the tail coordinates are (0, 0, 0) the 
vector's components аге x, y, <. Consequently, the vector's magnitude ||p|| equals 


V2? +y? +z. 


14.4.4 Unit Vectors 


By definition, a unit vector has a magnitude of 1. A simple example is i, where: 
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і= 1 0 0], where Ш =1 


Unit vectors are extremely useful in the product of two vectors, where their magni- 
tudes are required; and if these are unit vectors, the computation is greatly simplified. 

Converting a vector into a unit form is called normalising, and is achieved by 
dividing its components by the vector’s magnitude. To formalise this process, con- 


sider а vector r = [x y z]l, with magnitude ||г|| = yx? + y? + z2. The unit form 
of r is given by: 
x 1 T 
Е y z] 
lr ll 


This is confirmed by showing that the magnitude of f is 1: 


ee (=) +) +) 


= ER ху? + 2 
Ік| 


#1 = 1 


14.4.5 Cartesian Vectors 


A Cartesian vector is constructed from three unit basis vectors: i, j and k, aligned 
with the x-, y- and z-axis, respectively: 


i-[1 0 OT, і-101 О), k=[0 0 IIT 


Therefore, any vector aligned with the x-, y- or z-axis is a scalar multiple of the 
associated unit vector. For example, 10i is aligned with the x-axis, with a magnitude 
of 10. 20k is aligned with the z-axis, with a magnitude of 20. By employing the rules 
of vector addition and subtraction, we can compose a vector r by summing three 
scaled Cartesian unit vectors as follows: 


г — ai bj+ck 


which is equivalent to: 
г= [а b c]! 


where the magnitude of r is: 


|| = Va? + b + с? 
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Any pair of Cartesian vectors, such as r and s, can be combined as follows: 


r = ai + bj + ck 
s=di+ej+ fk 
г +5 = (а +4)1+ (btejj+(c+ /)К 


14.4.6 Products 


The product of two scalars is very familiar: for example, 6 x 7 or 7 x 6 = 42. We 
often visualise this operation as a rectangular area, where 6 and 7 are the dimensions 
of a rectangle's sides, and 42 is the area. However, a vector’s qualities are its length 
and orientation, which means that any product must include them in any calculation. 
The length is easily calculated, but we must know the angle between the two vectors as 
this reflects their relative orientation. Although the angle can be incorporated within 
the product in various ways, two particular ways lead to useful results. For example, 
the product of г and s, separated by an angle 0 could be ||r||||s|| cos Ө or |г| |8) sin 8. 
It just so happens that cos@ forces the product to result in a scalar quantity, and 
sin Ө creates a vector. Consequently, there are two products to consider: the scalar 
product, and the vector product which are written as r - s and r x s respectively. 


14.4.7 Scalar Product 


Figure 14.7 shows two vectors r and s that have been drawn, for convenience, with 
their tails touching. Taking $ as the reference vector—which is an arbitrary choice—we 
compute the projection of r on s, which takes into account their relative orientation. 
The length of г on $ is ||r|| соз Ө. We can now multiply the magnitude of $ by the 
projected length of г: ||s||||r|| cos @ This scalar product is written: 


Fig. 14.7 The projection of у, 
rons 
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r- s = |11111 сов0 (14.1) 


Because of the dot symbol ‘-’, the scalar product is also called the dot product. 
Fortunately, everything is in place to perform this task. To begin with, we define 
two Cartesian vectors r and s, and proceed to multiply them together using (14.1): 


r — ai c bj + ck 
s=di+ej+ fk 
r -s = (ai + bj + ck) - (di+ ej + fk) 
= ai- (di+ ej+ fk) 
+ bj: (di+ ej + fk) 
+ck- (41+ еј + fk) 
= adi-i+aei-jt+afi-k 
+ bdj-i+ bej-j+bfj-k 
+ cdk-i+cek-j+cfk-k 


Before we proceed any further, we can see that we have created various dot product 
terms such as i-i, i- j, i- К, etc. These terms can be divided into two groups: those 
that reference the same basis vector, and those that reference different basis vectors. 

Using the definition of the dot product (14.1), terms such asi-i, j- j andk- k = 
1, because the angle between i and i, j and j, ог К and К, is 0°; and cos 0° = 1. But as 
the other vector combinations are separated by 90°, and cos 90° = 0, all remaining 
terms collapse to zero, and we are left with: 


r-s=adi-i+ bej-jt+cfk-k 
But as the magnitude of a unit vector is 1, we can write: 
r- s = |lr|||/s|| cos = ad + Бе + cf 
which confirms that the dot product is indeed a scalar quantity. 
It is worth pointing out that the angle returned by the dot product ranges between 


0° and 180°. This is because, as the angle between two vectors increases beyond 
180° the returned angle Ө is always the smallest angle associated with the geometry. 


14.4.8 Тһе Vector Product 


As mentioned above, the vector product r x s creates a third vector whose magni- 
tude equals |г| ||8| sin 9, where Ө is the angle between the original vectors. Figure 
14.8 reminds us that the area of a parallelogram formed Бу г and s equals ||r || ||s]| sin Ө. 
Because of the cross symbol ‘x’, the vector product is also called the cross product: 
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Fig. 14.8 The area of the r 
parallelogram formed by r 
and s 


Fig. 14.9 The vector 
product 


іші. (14.2) 
ІҢІ = 1111811 sine 


We will discover that the vector t is normal (90°) to the plane containing the 
vectors r and s, as shown in Fig. 14.9, which makes it an ideal way of computing 
the vector normal to a surface. Once again, let’s define two vectors and this time 
multiply them together using (14.2): 


r — ai t bj + ck 
s=di+ej+ fk 
r x s = (ai + bj + ck) x (di + ej + fk) 
= ai x (di + ej + fk) 
+ bj x (di+ej+ fk) 
+ck x (di+ еј + fk) 
= adi x ic aei x j+afixk 
+ Ба] x i+ Бе] xj+bdfjxk 
+ сак x i + сек x j+cfkxk 


As we found with the dot product, there are two groups of vector terms: those that 
reference the same basis vector, and those that reference different basis vectors. 

Using the definition for the cross product (14.2), operations such as i x i, j x j 
and К х К result in a vector whose magnitude is 0. This is because the angle between 
the vectors is 0°, and sin 0° = 0. Consequently these terms disappear and we are left 
with: 
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rxs=aei xj+afixk+bdjxi+bfjxk+cdkxi+cekxj (14.3) 


Sir William Rowan Hamilton struggled for many years when working on quater- 
nions to resolve the meaning of a similar result. At the time, he was not using vectors, 
as they had yet to be defined, but the imaginary terms i, j and k. Hamilton's prob- 
lem was to resolve the products ij, jk, ki and their opposites ji, kj and ik. What 
did the products mean? He reasoned that ij = К, jk =i and ki = j, but could not 
resolve their opposites. One day in 1843, when he was out walking, thinking about 
this problem, he thought the impossible: ij = К, but ji = —k, jk =i, but kj = —i, 
and ki = j, but ik = — j. To his surprise, this worked, but it contradicted the com- 
mutative multiplication law of scalars where 6 x 7 — 7 x 6. We now accept that the 
commutative multiplication law is there to be broken! 

Let's continue with Hamilton's rules and reduce the cross product terms of (14.3) 
to: 

r x $ = аек — afj — Бак + bfi + cdj — cei (14.4) 


Equation (14.4) can be tidied up to bring like terms together: 
r x s = (bf — се)і + (cd — af )j + (ae — bd)k (14.5) 
Now let’s repeat the original vector equations to see how (14.5) is computed: 


r = ai + bj + ck 
s=di+ej+ fk 
r x $ = (bf — сей + (cd — af )j + (ae — bd)k (14.6) 


To compute the i scalar term we consider the scalars associated with the other 
two unit vectors, i.e. b, c, e, and f, and cross-multiply and subtract them to form 
(bf — ce). 

To compute the j scalar term we consider the scalars associated with the other 
two unit vectors, i.e. a, c, d, and f, and cross-multiply and subtract them to form 
(cd - af). 

To compute the k scalar term we consider the scalars associated with the other 
two unit vectors, i.e. a, b, d, and e, and cross-multiply and subtract them to form 
(ae — bd). 

The middle operation seems out of step with the other two, but in fact it pre- 
serves a cyclic symmetry often found in mathematics. Nevertheless, some authors 
reverse the sign of the j scalar term and cross-multiply and subtract the terms to 
produce — (af — cd) which maintains a visual pattern for remembering the cross- 
multiplication. Equation (14.6) now becomes: 


r x s = (bf —ce)i— (af — cd)j + (ae — bd)k (14.7) 
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However, we now have to remember to introduce a negative sign for the j scalar term! 
We can write (14.7) using determinants as follows: 


b cl. „РА ОЬ 
г = — 
ег“ ага 
ог 
"е ы e o 
rxs= i 
ELE |а 


Therefore, to derive the cross product of two vectors we first write the vectors in the 
correct sequence. Remembering thatr x s does not equals x r. Second, we compute 
the three scalar terms and form the resultant vector, which is perpendicular to the 
plane containing the original vectors. 

So far, we have assumed that: 


rxs=t 
ІҢ = 1121111511 sin Ө 


where 0 is the angle between г and s, and t is perpendicular to the plane containing 
r and s. Now let's prove that this is the case: 


г. $ = |01151 сов0 = х,х, + у, Уз + 22; 
(х.х, + yrys + 2,2,)2 

ole 
ШЕНЕ 

ІШІ? = 151121912 sin? 6 


= [|г| 7517 (1 — cos? 6) 


cos? 0 = 


2 2 (Xe Xs F Yr Ys F 2,2)? 
= [и (1 ETT 
11211211811 
= [|2 1$? — (хх, + угу» + zz? 


= (x7 + yp +) (х + у; +) — Certs Yes t nn 
=a (у; +) vir +) +, (ху + у;) 

— 2х,х,у,У» — 2XpXsZrZs — 2yr YsZrZs 
xd que + уух; + у, + Хх; ay 

— 2х,х5у, ys — 2Х„Х;®,%@; - 2yrYszrzs 


= (yr Zs - ту, + (2,х, - жу + (Xy. ys = ла 


which in determinant form is: 
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2 
Е? = Yr Zr Zr Xr Xr Yr 
Ys Zs Zs Xs Xs Ys 
and confirms that t is the vector: 
ta 10 |у [ee % % У | к 
У; Zs <; Xs Xs У; 


АП that remains is to prove that t is orthogonal (perpendicular) to г and $, which is 
achieved by showing thatr-t=s-t = 0: 


r—xi-cyjdzk 
s = xsi + yj + zk 
t = (YrZs — 2,55) + (ZrXs — х2) + (Xr Ys — YrXs)K 
r- t = X, (YrZs — ZrYs) + Yr(ZrXs — XrZs) + Zr (Xr уз — YrXs) 
= Xr YrZs — Xr YsZr + XsYrZr — XrYrZs + XrYsZr — XsYrZr = 0 
S- t = Xs (YrZs — Zr Ys) + Ys(ZrXs — XrZs) + Zs (Xr ys — WrXs) 
= Хуу; — Xs YsZr + XsYsZr — Xr YsZs + Xr YsZs — XsYrZs =O 


апа we have proved that r x s = t, where ||t|| = ||r||||s|| sin Ө and t is orthogonal to 
the plane containing r and s. 

Let’s now consider two vectors r and s and compute the normal vector t. The 
vectors are chosen so that we can anticipate approximately the answer. For the sake 
of clarity, the vector equations include the scalar multipliers 0 and 1. Normally, these 
are omitted. Figure 14.10 shows the vectors r and s and the normal vector t, and 
Table 14.2 contains the coordinates of the vertices forming the two vectors which 
confirms what we expected from Fig. 14.10: 


Fig. 14.10 Vector t is 
normal to the vectors r and s 
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Table 14.2 Coordinates of the vertices used in Fig. 14.10 


Vertex 


я & 
Pi 0 0 1 
P» П 0 19 
Рз 0 10 


г = [(х3 — x2) (з-») (z3—22)]' 
в=[(ху—х) (и у) z z)" 


P, = (0, 0, 1) 
P = (1, 0, 0) 
Р; = (0, 1, 0) 


r = —li + lj + Ok 
s = —li + 0j + 1k 
rxs=[1x1-0~x Oli 
—[—1х1—(—1) x 0]j 
[7-1 x0 — (-1) x ПК 
t=i+j+k 


Now let’s reverse the vectors to illustrate the importance of vector sequence: 


s = —li + 0j + 1k 
r= —li + 1j+ Ok 
sxr-[0x0—1 x 111 
-[-130-(-1)x ij 
+ [-1х1- (2D) x O]k 
і--і-і-К 


which is іп the opposite direction to r x s and confirms that the vector product is 
non-commutative. 


14.4.9 The Right-Hand Rule 


The right-hand rule is an aide mémoire for working out the orientation of the cross 
product vector. Given the operation r x s, if the right-hand thumb is aligned with r, 
the first finger with s, and the middle finger points in the direction of t. However, we 
must remember that this only holds in 3D. In 4D and above, it makes no sense. 
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14.5 Deriving а Unit Normal Vector for a Triangle 


Figure 14.11 shows a triangle with vertices defined in an anticlockwise sequence 
from its visible side. This is the side from which we want the surface normal to 
point. Using the following information we will compute the surface normal using 
the cross product and then convert it to a unit normal vector. 

Create vector г between Р; and Ру, and vector s between Р» and P»: 


r = —li + 1j + 0k 
s = —li + 0j + 2k 
гхв=(1х2—0х0)1 
(—1х2—0х —1)j 
+(—1х0—1х—1)К 
t = 21+ 2j+ 1k 
It] = /22+2?° +1? =V5 


— 274,245 2 
i, = 21+ 23+ К 


The unit vector Ê, can now be used for illumination calculations in computer graphics, 
and as it has unit length, dot product calculations are simplified. 


14.6 Surface Areas 


Figure 14.12 shows two vectors г апа s, where the height Л = ||s|| sin Ө. Therefore 
the surface area of the associated parallelogram is: 


area = ||r|| Л = ||r||||s|| sind 
Fig. 14.11 The normal 


vector t is derived from the 
cross product r x s 


>< 
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Fig. 14.12 The area of the 
parallelogram formed by two 
vectors r and s 


Z 


But this is the magnitude of the cross product vector t. Thus when we calculate 
r x S, the length of the normal vector t equals the area of the parallelogram formed 
by r and s; which means that the triangle formed by halving the parallelogram is half 
the area: 


area of parallelogram = ||t || 


А 1 
area of triangle = > ||] 


This makes it relatively easy to calculate the surface area of an object constructed 
from triangles or parallelograms. In the case of a triangulated surface, we simply 
sum the magnitudes of the normals and halve the result. 


14.6.1 Calculating 2D Areas 


Figure 14.13 shows a triangle with vertices Po(xo, yo), Pi (x1, уу) and Р,(хҙ, y2) 
formed in an anticlockwise sequence. The vectors r and s are computed as follows: 


r= (x1 — хо)і+ (yı — yo)j 
$ = (x2 — х0)і + (у2 — yo)j 
lr x s|| = (ху — X0)(y2 уо) — (x2 — хо) (у — yo) 
= X1(¥2 — yo) — хо(у2 — yo) — X21 — yo) + хол — yo) 
= хуу — X1 Yo — Xoy2 + xoyo — Хоу + X2yo + Хоу — Xoyo 
= хуу — X1yo — Xoy2 — Хәу + X2yo + хоу 
= (xoyi — Xiyo) + (x1y2 — xay1) + Gayo — Хоу2) 


But the area of the triangle formed by the three vertices is ilr x s||. Therefore: 


area = 3[(xoyi — xiyo) + Gi yo — xay) + (ху — x0y2)] 


which is the formula disclosed in Chap. 11! 


14.8 Worked Examples 255 


Fig. 14.13 The area of фе 
triangle formed by the 
vectors r and s 


>< 


147 Summary 


Vectors аге extremely useful and relatively easy to use. They аге vital to rendering 
algorithms and shaders, and most of the time we only need to use the scalar and cross 
products. 

Even if you already knew something about vectors, I hope that this chapter has 
introduced some new ideas and illustrated the role vectors play in computer science. 


14.8 Worked Examples 


14.8.1 Position Vector 


Calculate the magnitude of the position vector p, for the point P(4, 5, 6). 
Solution 


р = [4 5 6]', therefore, |р| = V4 + 52 + 62 © 8.77 


14.8.2 Unit Vector 


Convert r to a unit vector. 


Solution 
г= [1 2 3]! 
Iri = V 12 22 + 32 = v14 


^. | T x T 
#= all 2 317 ^ [0.267 0.535 0.802] 
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14.8.3 Vector Magnitude 


Compute the magnitude of r + s. 
Solution 


$ = 5i + 6j + 7k 
r+s=7i+9j+ ИК 


Ir +5|| = /72 +92 + 112 ~ 15.84 


14.8.4 Angle Between Two Vectors 


Find the angle between r and s. 


Solution 
г= [2 0 4" 
s=[5 6 10! 
151 = V22 + 02 + 42 ~ 4.472 
15| = V 52 + 62 + 102 © 12.689 
Therefore: 


ІГІЦ|8| cos = 2х5 +0 х6+4х 10 = 50 
12.689 x 4.472 х cos0 = 50 
50 


COS Ü = —————— % 0.8811 
12.689 х 4.472 


6 = arccos 0.8811 ~ 28.22? 


The angle between the two vectors is approximately 28.22°. 


14.8.5 Vector Product 


To show that the vector product works with the unit vectors i, j and k. 
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Solution 


s = 01+ 1j + Ok 


and then compute (14.7): 
rxs—(0x0—0xDi—(1x0—0x0)j4(1x1—0»x0)k 

The i scalar and j scalar terms are both zero, but the k scalar term is 1, which makes 

i x j = К. 


Let's see what happens when we reverse the vectors. This time we start with: 


r= 01+ 1) + Ok 
s = li + 0j + Ok 


and then compute (14.7): 


rxs-—(1lx0—0x0) —(0x0—0x 1)j+(0x0-— 1x Dk 


The 1 scalar and j scalar terms are both zero, but the К scalar term is — 1, which makes 
j x i= —k.So we see that the vector product is antisymmetric, i.e. there is a sign 
reversal when the vectors are reversed. Similarly, it can be shown that: 


jxk=i 
kxi-j 
kxj--i 
ixk=-j 
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Chapter 15 R) 
Equations еы 


15.11 Introduction 


This chapter is about equations and covers linear, quadratic and cubic equations, 
and concludes with some worked examples. 


15.2 Linear Equations 


The simplest linear equation is: 
y=mx+b 


where: 
e y is the dependent variable, and is a function of x. 
e m is the slope of the line. 
e x is the independent variable of the function y = f(x). 
e b is the line's intercept with the y-axis. 
Examples are: 


yx 
y=5x 
y=2x+3 
У--3х-2 


Тһе main problem with this equation is that it cannot define a vertical line. A form 
that copes with all lines, has a simple vectorial description as follows. 
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Fig. 15.1 A definition of a 
line 


(т, y) 


У-|- y-— yo]" 


Ву 


With reference to Fig. 15.1: 

e n = [а b]l is anormal vector to the line. 

e (xo, yo) is a known point on the line. 

e (x, y) is another point on the line. 

e у= [х — xo у — yo]! is a vector from (xo, yo) to (x, y). 
The dot product п. v = cos 90? = 0: 


n-v=0 

a(x — xo) + b(y — yo) = 0 
ax + by — (ахо + Бу) = 0 
ах +by+d=0 


where d = —(ахо + byo). 
As an example, let = [1 21 and (xo, yo) = (1, 3), therefore: 


a=1 
b=2 
хо = 1 
yo = 3 
d = —(ахо + byo) 
--(1--6) 
--7 
ax+by+d=0 
x42y—-720 
у= (7- х)/2 


Figure 15.2 shows the line у = (7 — х) /2 with the point (1, 3) indicated and ће 
normal vector. 
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Fig. 15.2 The line 1 
у-(7-ху/2 


The idea generalises to three dimensions where: 
еп = [а b c]l is anormal vector to the plane. 
ө (xo, Yo, zo) is a known point on the plane. 

e (x, y, z) is another point on the plane. 


ev=[x—x% у= yo z-— zo]! is a vector from (xo, уо, zo) to (x, y, 2). 
The dot product п. у = cos 90° = 0: 


n-v=0 

a(x = хо) + b(y — yo) + c(z — zo) = 0 
ах + by + cz — (axo + byo + czo) = 0 
ах + by+cz+d=0 


where d = — (axo + byo + czo). 
As an example, letn = [1 1 ИТ and (xo, yo, zo) = (1, 2, 3), therefore: 


а= 1 

р= 1 

с = 1 

xo = 1 

yo = 2 

50 = 3 

d = – (ахо + Буд + czo) 
= —6 


ax+by+cz+d=0 
x+y+z—-6=0 
2-6-х-у 


Figure 15.3 shows the plane z = 6 — x — y with the point (1, 2, 3) indicated апа 
the normal vector drawn twice the length. 
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Fig. 15.3 The plane 
z—6—x-—ywitha 
vertical z-axis 
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15.3 Quadratic Equations 


A quadratic equation takes the form: 


ax? + bx +c=0 


where a = 0 


A quadratic equation is solved by two values of x which are called the solutions or 
roots. These are generally real, but sometimes complex. The solutions are given by: 


—b+ P? — 4ac 
x= —— 


2a 


For example, 2x? + 5x + 3 = 0 is solved as follows: 


2x? +5х+3=0 


a=2 
b=5 
c=3 
-5:--./52-4х2х3 
с 2x2 
—5 +1 
4 
-5+1 
7 =- 


X] = 
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-5-1 
— = 


—1.5 


X2 = 


Therefore, the equation is solved by x = —1 and x = —1.5. Let’s substitute these 
values into 2x? + 5х + 3 = 0 and prove that they are correct. 


2x)! +5x+3=0 


х=-1 
2х (-1)2 +5 х (-1I) +3 =2-54+3=0 
x=-1.5 


2 x (1.5) +5 х(—1.5)+3=4.5—7.5+3=0 


therefore, they are correct. 


Figure 15.4 shows a graph of 2x? + 5x + 3 = 0 with the solutions indicated. 
When a quadratic has identical roots, such as 2x? + 2x + 0.5 = 0 where хі = 
хә = —0.5, this is because: 


a=2 
b=2 
c=0.5 
~24 J/2—4x2x05 
= 2х2 
= —0.5 


the radical is zero, i.e. b? = 4ас. 


When a quadratic has complex roots, such as 2x? + 2x + 1 = 0 where хі = 
—0.5 + 0.51 and x; = —0.5 — 0.51, this is because: 


Fig. 15.4 The graph of 
2x? +5x+3=0 


Osx 
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a 
b= 
С 
—2 + 22--4х2 
x= 
2x2 
—2 + 21 


4 
xı = —0.5 + 0.51 
x = —0.5 — 0.51 


The radical is complex, i.e. b? < дас. Let's substitute these values into 2x? + 2x + 
1 = 0 and prove that they are correct: 


2x? 42x +1=0 


x = —0.5 + 0.51 
2 x (—0.5 + 0.5i)(—0.5 + 0.51) + 2 х (—0.5 + 0.51) + 1=0.5-i-05-—1+i+1=0 
x = —0.5 — 0.51 


2 x (—0.5 — 0.51)(-0.5- 0.51) + 2 x (—0.5 — 0.51) + 1 =0.54+1-05-—1-i+1=0 


therefore, they are correct. 


15.4 Cubic Equations 


A cubic equation takes the form: 
ax? + bx*+cx+d=0 


and there are no direct definitions of the solutions to this equation. However, if we 
start with: 
х? +2x? — 8х = 0 


we can factor out x as follows: 


x(x? + 2x — 8) = 0 
x(x + 4)(х — 2) 20 
х= 0 
х = —4 


хз = 2 
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Fig. 15.5 Тһе graph of 
х? 42x? — 8x 20 


Figure 15.5 shows the graph of x? + 2x? — 8x = 0. Although this example has 


been chosen to have simple roots, the quadratic can be solved using x = ev ae ур лас 

For those wishing to discover more about solving cubic equations ake a look 
at (https://www.wikihow.com/Solve-a-Cubic-Equation) and (https://en.wikipedia. 
org/wiki/Cubic_equation). 


15.5 Worked Examples 
15.5.1 Line Equation 


Show that a linear equation of the form ax + by + c = 0 can indicate the vertical 
line x = 3. 


Solution 
The vertical line x = 3 is defined as follows: 

өп = [1 O] isa normal vector to the line. 

e (3, 0) is a point on the line. 

e (x, y) is another point on the line. 

ev=[x—3 у —OJ isa vector from (3, 0) to (x, y). 
The dot product п. у = cos 90° = 0: 


n-v=0 

П O]T.[x -3 у] =х-3+0=0 
х-3= 0 
х= 3 


The equation of the vertical line is x = 3. 
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15.5.2 Plane Equation 


Find the equation of the plane when the normal vector is n = [2 2 3]T and the 
point (xo, yo, zo) = (2, 2, 3) is on the plane. Sketch the plane. 


Solution 

a=2 

b=2 

QE 

X9 =2 

yo =2 

50 =3 

d = —(ахо + byo + czo) 
--(4--4--9) 
= —17 


ax+by+cz+d=0 
2х + 2у + 32 – 17 = 0 

17 — 2х —2y 
= 3 


Figure 15.6 shows the plane z = (17 — 2x — 2y)/3 withthe point (2, 2, 3) indicated 
and the normal vector. 


Fig. 15.6 The plane 
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15.5.3 Quadratic Equation 


Find the roots of x? + 8х + 3 = 0 and prove that they solve the equation. 
Solution 


х2 + 8х +3 = 0 


а= 1 
р = 8 
с= 3 
—8 + 82 4х 1х3 
x= 
2х1 
ВЕ 
Ў 2 
—8+ 7.211 
xp A — = —0.394 
—8 — 7.211 
хә Buc EE = —7.606 


Therefore, the equation is solved by x ~ —0.394 and x ~ —7.606. 


х2 + 8х +3 = 0 
x & —0.394 
(—0.394)? + 8 x (—0.394) + 3 = 0.1552 — 3.152 +3 ~ 0 
x © —7.606 
(—7.606)? + 8 х (—7.606) + 3 = 57.8512 — 60.848 +3 ~ 0 


therefore, they are correct. 


15.5.4 Cubic Equation 


Solve the cubic equation x? + 3x? — 18x = 0, and show that they solve the equation. 


Solution 
We can factor out x as follows: 


x(x? + 3x — 18) 20 
x(x + 6)(х — 3) =0 
x=0 
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0 -3x0—18x020 


x =—6 
(—6)? + 3 x (—6)? — 18 x (—6) = —216 + 108 + 108 = 0 
х= 3 


3 +3 х 3? — 18 х 3 = 27 + 27 – 54 = 0 


The cubic equation has roots x = 0, х = —6 and x = 3. 
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Chapter 16 A) 
Complex Numbers geun 


16.1 Introduction 


This chapter reviews complex numbers, where the complex plane is described as 
a way of visualising complex numbers and various algebraic operations, and two 
functions for isolating the real and imaginary parts of a complex expression. The 
section on Complex Algebra examines topics such as the complex conjugate, powers 
of i, complex exponentials, logarithms of a complex number, and the hyperbolic 
functions. Finally, there are a dozen worked examples. 

Readers interested in the history of complex numbers are invited to read the 
author’s book Imaginary Mathematics for Computer Science (Vince 2018) where 
they will discover the problems associated with the choice of the words imaginary 
and complex. For this chapter, the reader should forget the every-day meaning of 
imaginary, and regard i = ./—1 as a mechanism for dividing a mathematical expres- 
sion into two parts: a real part, and a part associated with i. 


16.2 Representing Complex Numbers 


This section explores various ways of representing complex numbers numerically 
and graphically. 


16.2.1 Complex Numbers 


Numbers such as a + bi form the set of complex numbers C, where: 
a,beR, Р=-1 
Examples are 2, 23— 12i, 3+ x?i. 
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16.2.2 Real and Imaginary Parts 


The real and imaginary parts of a complex number z are isolated by the Re and 
Im functions. For example: 


-а-ы 
a = Re(z) 
b = Im(z) 


These two functions permit one to construct formal algebraic definitions such as 
defining one complex number being equal to another. In words, one would say ‘two 
complex numbers are equal iff they have identical real and imaginary parts.’ e.g. 
given zı = ху + yıl and 52 = x? + yoi, then zı = 22, iff xı = x2 and у! = y2. Using 
Re and Im, we can write: 


1-22 <> [Re(z,) = Re(z2) A Im(zi) = Im(z2)] 


16.2.3 Тһе Complex Plane 


When the real number line is combined with a vertical imaginary axis, it creates the 
complex plane, as shown in Fig. 16.1, where any complex number has a unique 
position. Figure 16.2 shows the positions of the following four complex numbers: 


Р-4--3, О--3--2, R=-3-31, 8-4-б5і 


Fig. 16.1 Тһе complex 
plane 


16.3 Complex Algebra 


Fig. 16.2 Four complex 
numbers 


Imaginary 
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Real 


This section reviews the axioms associated with complex algebra, the complex con- 
jugate, complex division, powers of i, the rotational properties of i, polar notation, the 
complex norm, the complex inverse, complex exponentials, the roots and logarithms 
of a complex number, hyperbolic functions, and the role of the complex plane in 


visualising complex functions. 


16.3.1 Algebraic Axioms 


Complex numbers obey the axioms associated with real numbers. But for clarity, 


examples are included to show how imaginary terms are resolved. 


Given: 


zi =x + уп 
z2 = № + у! 
53 = X3 + узі 


The commutative law of addition: 


21+ 52 = 2 +41 = (x1 + хә) + (у + yi 


e.g. (2+ 31) + (4+ 51) = 6+ 8i 
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The commutative law of multiplication: 


2122 = 2221 
e.g. (2+ 31) (4+ 5i) = 8 + 10i + 121+ 1512 
--7--221 


The associative law of addition: 


21 + (52 + 33) = (z1 + 22) + z3 = Z1 + 52 + 23 
e.g. (2 + 31) + (4+ 51) + (6 - 7i) = 12+ 15i 


The associative law of multiplication: 


210223) = (2122723 = 212223 
e.g. (2 + 31)(4 + 5i)(6 + 7i) = (8 + 221 + 1512)(6 + 71) 
= (—7 + 221)(6 + 71) 
= —42 + 132i — 491+ 1542 
= —196 + 831 


The distributive law of multiplication: 


zi(za + 23) = ziza + 2123 
e.g. (2 + 31) [(4 + 51) + (6+ 7i)] = (2 + 31)(10 + 121) 
= 20 + 30i + 241 + 36i? 
= —16 + 54i 


From the above, one can see that the addition of complex numbers is identical to 


the addition of vectors. Figure 16.3 illustrates the addition of zı = —2 + 3i and 
22 = 3+1. 


16.3.2 Complex Conjugate 


The complex conjugate is a useful algebraic construct and is denoted by 2 or z*. To 
avoid confusion, I will always use 2. 
Given z = a + bi, then z = a — bi, which can be expressed as: 


z = Re(z) — Im(z)i 


The product zz is extremely useful, as it is a real quantity. Generally: 
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Fig. 16.3 The addition of 
51 + 52 


21 ЛГ fis 


z=atbi 
z=a-—bi 
zz = (a+ bi)(a — bi) 


= а? — abi + abi — Ь212 


=a? +b? 
which is real. For example: 

z=3+4i 

7-3-ді 

zz = 25 
Figure 16.4 shows z and 2. 
Let's prove that 2122 = 2122. 
zı =a; + bii 


52 = ал + bj 
3122 = (ai + bii) (a + 201) 
= а1(а + boi) + bii(a + boi) 
= aı (m — 621) — bii(a - boi) 
= (а — bii)(a» — bai) 
= 2122 
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Fig. 16.4 A complex 
number and its complex 
conjugate 


16.3.3 Complex Division 


The complex conjugate is useful in resolving complex division; for if we multiply 
the numerator and the denominator by the complex conjugate of the denominator, 
the denominator becomes a real quantity. For example, we evaluate this quotient as 
follows: 

_ pom 

142i 

(10 + 5i) (1 — 2i) 
© (42i) (1-21) 
_ 00+ 5)d — 21) 


5 
= (2+i)(1 — 21) 
=2-4i1+i- 21 
= 4 — 31 
16.3.4 Powers of i 
As i? = —1, let's consider other powers of i. For example: 


і = 1212 = 1 


апа 


Table 16.1 shows the sequence up to Г. 
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Table 16.1 Increasing powers of i 


10 ài 10. i 


1 i zil zu 1 i =! 


This cyclic pattern is quite striking, and reminds one of: 
X, у, =X, =Y, Х,... 


that arises when rotating around the Cartesian axes in an anticlockwise direction. 
The above sequence is summarised as: 


where n є № 
--і 


But what about negative powers? Well they, too, are also possible. Consider i^! , 
which is evaluated as follows: 


сн CUN. р 
1 1(—1) 1 
Similarly: 
pt M Жз 1 
а= 
апа 
i? = i77? = —i(—1) =i 


Table 16.2 shows the sequence down to i$. This time the cyclic pattern is reversed 
and is similar to: 
X, =Y, —X, y, Х,... 


that arises when rotating around the Cartesian axes in a clockwise direction. 


Table 16.2 Decreasing powers of i 


10 gel qe 


1 zi =i i 1 =i =! 


3 1—4 = 
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16.3.5 Rotational Qualities of i 


Now let's investigate how a real number behaves when it is repeatedly multiplied by 
1. Starting with the number 3, we have: 


1х 3 = 31 

іх 3i= —3 
іх (—3) = 3i 
іх (-3)i=3 


The cycle is 3, 31, —3, —3i, 3, 31, —3, —3i, 3, ... with four steps, as shown іп 
Fig. 16.5. 

If we multiply a complex number by i, it is also rotated 90°. For example, the 
complex number Р = 4 + 31 in Fig. 16.6 is rotated 90° to О by multiplying it by i: 


i(4 + 3i) = 4i + 312 


=4—3 
=—3+4& 


The point О = —3 + 41 is rotated 90° to R by multiplying it by 1: 


i(—3 + 4i) = —3i + 412 
=—3i-4 
=—4-— 3i 


The point R = —4 — 3i is rotated 90° to 5 by multiplying it by 1: 
Fig. 16.5 Тһе cycle of 


points created by repeatedly 
multiplying 3 by i 
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Fig. 16.6 Multiplying a 
complex number by i 


i(—4 — 31) = —4i — 312 
—4i 4-3 
—3-— 4i 


Finally, the point 5 = 3 — 4i is rotated 90° back to P by multiplying it by i: 


iG — 41) = 31 — 47° 
=3i+4 
=4+31 


As you can see, complex numbers are intimately related to Cartesian coordinates, 
in that the ordered pair (x, y) = a + bi. 


16.3.6 Modulus and Argument 


As a complex number has a unique position on the complex plane, and is always 
relative to the origin of the real and imaginary axes, it can be visualised as a position 
vector and assigned a modulus or magnitude, which is the distance r of the com- 
plex point to the origin; consequently, z = а + bi has a modulus ғ = Ма? + b? and 
notated as |z| = Ма? + 2. This can also be expressed as: 


Iz? = a? + b? = [Re(z)P + Пиц)? 


Here are some useful relationships: 
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z= (a + bi(a— bi) 2a? + b? 


22- |212 
|2] = [| 
[= z| = |z| 


|2122] = 1211221 


2 2i 12 
|2122]7 = |41] 1 


Pursuing the similarity between complex numbers and position vectors, the 
straight line from the origin to a complex number z = a + bi, subtends with the 
real axis an angle 6, called the argument; consequently, 0 = (ап! (b/a), and is 
notated as arg(z) = tan™! (b/a). Figure 16.7 shows the complex number z = 3 + 41 
with a modulus г = 4/32 + 4? = 5 and an argument 0 = tan! (4/3) = 53.125°. 

From Fig. 16.7 we can generalise that a complex number z = a + bi has real and 
imaginary components: 


a —rcosÓ 


b-—rsinO 


which permits us to state: 
z —rcos0 + ir sin 


and when г = 1: 
z = cos + isin 


For example, given two complex numbers: 
Fig. 16.7 The argument 0 


and modulus r of a complex 
number 5і 


Imaginary 


41 


rsin@ 


-5 4 32 -1 0 п 2 3 4 5 | Real 
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21 = а +В 
52 = аз + boi 
where: 
a, = rı cos 01, bj = ri sin 0; 
dy = r3 COS Ө), by = р sin 05 
therefore: 


2122 = (a) + bii)(a» + Doi) 
= (ауаз — о) + (ay bz + bya2)i 
= (rı cos 0 · r2 COS Ө — г sin Өү. r2 sin 0) 


+ i(rı cos 01 - r2 sin Ө) + ri sin 0] · r2 cos Ө») 
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(16.1) 


= 1172 (cos 01 cos 05 — sin 0] sin 05) + irjr2(cos 0; sin Ө» + sin 0; cos Ө») 


= ғғ СО5(бі + 62) + irır2 sin(0; + 02) 


(16.2) 


which shows that to compute the product of two complex numbers, we multiply their 
moduli and add their arguments. Let's illustrate this operation with an example. We'll 


start by computing the product using (16.1). 
Given: 


1 V3. 
1=5+ 1 
ЖЕСТ 
=e ay 
ЛАМ 3% Мз v3 
an 5 ( 2) 2 2 t 2 F 
1 3 (+ УЗ). 
к= + 1 
4 4 4 4 
= –1 


Next using (16.2). But first, we need to compute ће moduli and arguments: 
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2122 = cos(60° + 120°) + isin(60° + 120°) 


Naturally, the results are the same. 


16.3.7 Complex Norm 


The term norm causes a lot of confusion, simply because there are so many, and 
each one requires a precise definition. For our purposes, norms are associated with 
vector spaces, where the norm of a vector is a function that returns some numerical 
property of the vector. The Euclidean norm of vector v, is generally written: 


Ivi = УУ-У 


which is the square-root of the inner product of the vector with itself. For example, 
if the vector у = [3 4], then ||v|| = 4/3? + 42 = 5, and represents the Euclidean 
length of the vector. 

The absolute value of a signed number +x is written |x|. For example, if x = 
+23, |x| = 23, and if x = —23, |x| = 23. The absolute-value norm ||x||, equals 
the absolute value, i.e. ||x|| = |x]. 

The complex norm or Euclidean norm of a complex number z = a + biis given 


by: 
ІСІ = Iz] = va? + b? 


which is the modulus of z. 

The modulus or Euclidean norm of a complex number measures an abstract dis- 
tance corresponding to the length of the complex point to the origin on the complex 
plane, and is normally expressed: 


llzll = М = v(a + bia = bi) = Va? + b? 


Let’s prove that the Euclidean norm of the product of two complex numbers, equals 
the product of the individual Euclidean norms. 
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zı = а + bii = r1, Ө, 
52 = а + boi = r2, 6; 


[212211 = Iziz2l 
= |411: 10| 
= zi: [521] 


16.3.8 Complex Inverse 
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We have already seen that to divide a complex number x by another z, we multiply 
the numerator and denominator by the conjugate of the denominator: 


which can be written as: 


thus the complex inverse is: 


X a+bi с-а 
2 c+di c—di 
xz x < 

Ин 
E z 
ZW 

|212 


For example, the inverse of 3 + 41 is: 


6+4 = 20-4) 


16.3.9 Complex Exponentials 


In order to describe complex exponentials we require three power series. We start 
with the power series for е?, sin Ө and cos Ө: 


e?—1 
5100 = 0 
cos = 1 


o0! g9 ө 04 65 06 607 0% g 
Tar cw ha л ee ө л BOF 

ө 05 ө ө 

и л ө 

0? 64 06 08 

жїл Ө 
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Fig. 16.8 The sine power 
series for different number of 
terms 
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These series explain why the radian of angular measurement is used, for when 0 = л, 


sin 0 = 0, and соѕ0 = — 
5, 7 and 9 terms. 


1. Figure 16.8 shows curves of the sine power series for 3, 


Euler discovered that by making 0 imaginary: е, we have: 


и i0! 5 ug 10% Ө? 0% d^ 6% 16? 
SES r oe a л з GO sat 
02-04 06 68 i0! 103 105 107 10° 
ТТТ 9! 
- 0? 64 06 08 (0 0 6 өт g 
T ata a sa i HA a a тін тала 


= cos + isin 


which is Euler’s trigonometric formula. If we now reverse the sign of 10 to —10, 


we have: 
а 10! 02 103 0 10° 06 107 08 i0? 
= п 2 ae al 5! ^w 917 
E o ot o6 o? i9! 10° 10° 107 10 
~ палета и 9 
В. (9 ө ө ө 0? 
T 3 Cum ta Anas лт ө 
= cos — ising 
thus we have: 
e? = cos -FisinO 
e ? = соѕ0 — іѕіпӨ 


from which we obtain: 
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ей + gió 
cos Ü = ————— 

2 
10 —10 

. e — e 
5100 = - 
21 
Given е = соз Ө + isin, when Ө = л, we have e" = —1, or e" + 1 = 0, which 


is Euler’s famous equation. The American physicist Richard Feynman (1918-1988), 
referred to the equation as ‘our jewel’ and ‘the most remarkable formula in mathe- 
matics.’ (Feynman 1977) 

Another strange formula emerges as follows: 


cosÓ + ising = e? 


л ‚2. үл ; 
cos (5) + isin (5) LI 
і = e? 
oo (e?) 
— е?л/? 
— ө-л/2 


i! = 0.207879576... 


which reveals that i! is a real number, even though i is not a number, as we know it! 

Geometrically, e? is a point on the unit circle, on the complex plane. Consequently, 
re? is another point, radius r from the origin, with real and imaginary coordinates 
x =rcos@ and у = г sin Ө, respectively, as shown in Fig. 16.9. This is the polar 
form of a complex number. 


Fig. 16.9 The unit circle 
and rei? 
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Let’s return to the product of two complex numbers, and see how the product can 
be visualised using polar notation. 
Equation (16.2) shows that: 


zı = ғ (cos 6, + isin 91) 
z2 = "(сов Ө» + isin 62) 
2122 = rjr» (соѕ(0 + 62) +1sin(O; + 05)) 


Using polar notation: 
Z1 = rye! 


22 = re 


2122 = пейте? 


= r rael +t) 
Now let's show three ways of computing the product of the following three com- 
plex numbers shown in Fig. 16.10: 


uu d 

2=1+V3i 
42. wa. 

а= >= =] 
2 2 


Fig. 16.10 Three complex 
numbers 
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21: the argument is 30° and a modulus of 3. 
Z2: the argument is 60° and a modulus of 2. 
23: the argument is 225° and a modulus of 1. 
Let’s compute the product 212223: 


212223 = (52 + 3 (1 + Уз) (-% - x 


(52 9 3, | /2 75) 
= ---1---1 1 


2 22 2 2 2 


2342 — 3/21 


which confirms that the product 212223 rotates any complex number 315°, and scales 


its modulus by 6. 
Now let’s compute the product using cosines and sines: 


zı = 3(cos 30° + isin 30°) 
z2 = 2(cos 60° + isin 60°) 
23 = cos 225° + isin 225? 
212223 = 3(cos 30° + i sin 30?)2(cos 60° + i sin 60°) (cos 225° + isin 225°) 
= 6(соѕ 315° + isin 315°) 


= 3/2 — 3/21 


which is much simpler. Finally, let’s define the complex numbers in polar form, with 
angles in degrees, for clarity: 


z= Зе 30° 
22 = 26/50 
z3 = е!225 
515223 = ЗеіЗ0 26:50 е!225 


ze 
= 6(соѕ 315° + isin 315?) 
= 342 — 3/21 


which is even neater! 
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16.3.10 de Moivre's Theorem 


Euler's trigonometric formula can be developed as follows: 


cos@ +isind =e” 


(cos 0 +isin6)" = (e^)" = e? 

(сов0 + isin)” = сов(п0) + isin(n0) (16.3) 
where (16.3) is known as de Moivre’s theorem, after the French mathematician 
Abraham de Moivre (1667-1754). 

Substituting п = 2 in (16.3) we obtain: 
соѕ(20) + isin(20) = (cos Ө + i sin 0)? 


= cos? 0 — sin? 0 + 12 соз0 sin Ө 


Therefore: 
cos(20) = Re (cos? 0 — sin? 0 + i2cos@ sin 8) 
= cos? 0 — sin? 0 
sin(20) = Im (cos? 0 — sin? 0 + i2 cos sin 0) 


= 2с050 sin д 


de Moivre’s theorem can be used for similar identities by substituting other values 
of n. Let's try n = 3: 
cos(30) + isin(30) = (cos0 + isin 6) 
= (cos0 + 15іп 0) (cos? 0 — sin? 0 + i2 cos 8 sin Ө) 


Therefore: 


cos(30) = Ке [ (cos Ө +isin@) (сов? 0 — sin? 0 + 12 соз 0 sin 9) ] 
= cos? 0 — cos 6 sin? Ө — 2cos 0 sin? 0 
= cos? 0 — 3cos 9 sin? 0 
= cos? ð — 3 соѕ0(1 — сов? 9) 
= 4cos* 0 — 3cos0 
sin(30) = Im [(cos Ө + i sin 0) (cos? Ө — sin? 0 + i2 cos Ө sin 6)] 
= cos? 0 sin 0 — sin? 0 + 2cos? 0 sin@ 
= 3cos? 0 sin — sin? 0 
= 3510 0(1 — sin? 0) — sin? 0 


= 3sin0 — Asin? 0 
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Let’s test these identities with 6 = 30°: 


cos(30) = 4cos? 0 — 3 cos 0 
cos 90? — 4 cos? 30? — 3 cos 30? 


Е PE 

es pes 
3 3 

= Be 


sin(30) = 3віп0- 4 sin? 0 
sin 90° = 3 sin 30? — 4 sin? 30° 


Кк гү 
72 2 
=1 


Given z = cos Ө + isin Ө, we can define cos 0 and sin 0 in terms of 2 as follows: 


z = cos +isind (16.4) 
— e 

1 ; 

=== ег” 

2 
= cos — isinÓ (16.5) 


adding and subtracting (16.4) and (16.5): 
1 
z+- = 2со080 
2 
2- = = 12810 6 
2 
1 1 
соѕ50 = – {2+ – (16.6) 
2 2 
: ‚1 1 
5100 = —i- {= – – (16.7) 
2 2 
Let’s use de Moivre’s formula to show that: 


1 
2" + — = 2сов(п0) 
gh 
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Proof: 


z” = cos(n@) + isin(n@) 
z” = cos(—n0) + isin(—n0) 
cos(n@) — i sin (n0) 
z” +z” = 2с08(п0) 


16.3.11 nth Root of Unity 


16 Complex Numbers 


The real roots of 1 can only be +1, but complex numbers introduce the concept 
that unity possesses an infinite number of complex roots. The complex roots of 1 
satisfy the equation z” = 1, where n is a positive integer. Such roots are employed 
in different branches of mathematics, such as number theory and discrete Fourier 


transforms (https://en.wikipedia.org/wiki/Root_of_unity). 


If the nth root of 1 is z, Шеп 2” = 1. Therefore, using de Moivre’s theorem: 


PY se". k=0,1,2,...,n— 1 


(=) ‚2, (=) 
= cos | — | +1sın | —— 
n n 


For example, when n = 3: 


0 0 
[k = 0] co = 00s (3) вы (5) =1 


+ 
[k = 1] анан ш 
= = Cos | — 1510 
21 3 3 
ЕТ Вы = 
=2] а-с(- isin | 7-J— 


Let’s confirm these results: 
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Fig. 16.11 Three roots of Im 
unity 


ЕСІ 14/3, 
== еке 1122772 из 


These roots are located on the unit-radius complex circle, as shown in Fig. 16.11. 
Connecting the points together creates a regular polygon. 


16.3.12 nth Roots of a Complex Number 


Given: 
z = r(cos0 + isin) 


then: 
6+k2 6+k2 
i= | cos (EET) e isi (277. . 0<k<(n-1) 
n n 


1/4 
For example, let’s find (8 + i83) Я 


To begin, we convert it into polar form: z = 16e: 


z = 16 [cos (5) + isin (5) 


ds ME (5) de (22 t 223] 
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Fig. 16.12 4th roots of Im 


16(сов(л/3) -Ғісіп(л/3)) 


[k=0] w= 2 [cos (=) + isin (5)| = 1.932 + 0.5181 
к= 1] а =2[[е (+ +Æ) eism (5 = 
[К = 2] a= (+) eis (= z 
emal Ерте 


Figure 16.12 shows these roots. 


16.3.13 Logarithm of a Complex Number 


In order to take the natural logarithm of a complex number, we use the exponential 


) А 0.518 — 1.9321 


+ =) = —0.518 + 1.9321 


F =) А —1.932 — 0.5181 


form. Consequently, if we are given a + bi, this must be converted to ге. Therefore, 


given: 
z=re 


then the logarithm of a complex number z equals: 
Inz = Inr +10 


Or 
Inz = In |z| + i arg(z) 
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As exponential functions can have multiple values, the imaginary component is 
restricted to the interval —л < 0 < л. For example, —1 is represented by 
ел eJ ебі etc., but to satisfy the agreed interval constraint, —1 = ет, 


For example, given z = —2 + 21, then: 
—242i= y (—2)2 + 22. gitan ! (2/-2) 
= А/8е10:75т 
In(-2 + 2i) = In (Веб?) 
= In V8 + 0.75л1 


А 1.039721 + 2.356191 
Similarly, given z = 3 — 41, then: 


3 — 4i = 32 + (—4)2. ei tan”! (4/3) 


x Se 10.927295 


In(3 4i) жіп P 1092725) 
& In5 — 0.9272951 
А: 1.609438 — 0.927295i 


Logarithms of other complex numbers are shown in Table 16.3. 


16.3.14 Raising a Complex Number to a Complex Power 


Now that we have seen how to take a logarithm of a complex number, the way is 
open to raise a complex number to a complex power. For example, given: 


z= (16.8) 


then: 
у= 10: (16.9) 


and substituting (16.9) in (16.8), we obtain: 
z= е (16.10) 


raising both sides of (16.10) to some power w: 
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Table 16.3 Logarithms of complex numbers 


1 е form Inz 

1 сі0 0 

=i сіл лі 

і сіл/2 ті 

=i етіл/2 -ті 

5 5:10 1.609438 

-5 зет 1.609438 + лі 
5i seit /2 1.609438 + Fi 
—5i se-in/2 1.609438 — Ті 
5+5 ,/5овіл/4 1.956012 + Ті 
5—5i 50e -in/4 1.956012 — Жї 
5+5 „/50еіЗт/4 1.956012 + Злі 
5-5 50e 137/4 1.956012 — 3f i 
0.5 0.5еі0 0.693147 

—0.5 0.5еїт —0.693147 + лі 
0.5i 0.57 /2 —0.693147 + Fi 
mosi 0.5 іл/2 —0.693147 — Fi 


Complex Numbers 


(16.11) 


Equation 16.11 applies to both real and complex numbers, so first, let’s begin with: 


222 
w=1+1 


which requires raising e to the product of 1 + i and the natural logarithm of 2: 


therefore: 


In2 ~ 0.693147 


(1 + 1)0.693147 = 0.693147 + 0.693141 


е(0.693147--0.6931471) = е0-693147 0.693 147i 


= 2(cos 0.693147 + isin 0.693147) 


~ 2(0.769239 + 0.63896 1) 


& 1.538478 + 1.2779221 


2191 а 1.538478 + 1.277922i 
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Now let’s use: 


1=2 + 21 
и = 1+1 
Шеп: 
z” — 2 $ 2i)! ti 


= e? In(2+2i) 


which requires raising e to the product of 1 + i and the natural logarithm of 2 + 2i. 
Not very nice, but let’s have a go! 


2+2i = /2? + 2267/4 
= J8ei7/4 


In(2 +21) = 1n 2.828427 + т 


А: 1.039721 + 0.7853981 
(1+1) (1.039721 + 0.7853981) = 1.039721 + 1.0397211 + 0.7853981 — 0.785398 


= 0.254323 + 1.825119 
е(0.254323+1.8251191) _ 0.254323 ,1.8251191 


= 1.289588(cos 1.825119 + isin 1.825119) 
А 1.289588(—0.25159 + 0.9678341) 
А —0.324447 + 1.2481071 


therefore: | 
(2 + 21) ғ —0.324447 + 1.2481071 


16.3.15  Visualising Simple Complex Functions 


We are aware of how real functions such as f(x) = 2x? + 3x +5 behave, as it is 
possible to draw a graph relating f (x) to x. But when it comes to complex functions, 
such as f(z) = (а + bi)”, we require two dimensions to represent the original real 
and imaginary terms, and two further dimensions to represent the function, which 
is difficult in our three-dimensional world. However, in order to get a feel for what 
is happening between a complex variable and function, we can plot how individual 
numbers behave when subject to a function. To illustrate this, Fig. 16.13 illustrates 
how nine complex numbers in the first quadrant, behave when they are subject to 
a square function. For example, (1 4- 31)? moves to —8 + 6i, and (3 + 3i)? moves 
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Fig. 16.13 The trajectories Im 
of nine complex numbers in 
the first quadrant, when 
squared 


to 181. The dashed lines show the trajectory as the exponent increases from | to 2. 
Note that the squaring function imposes an anticlockwise rotation on the trajectories, 
with the end complex numbers in the same or second quadrant. The solid blue lines 
connect the transformed points together to emphasise the distortion caused by the 
squaring transformation. 

The functions used to draw the blue lines are: 


Ро) = [0-00 — 0) + A + 30р 

Ро) =[2+)0 — 0) + Q3) 

Ро) -2IG- 00 -A + B+ 3ieP тее 
Ро) =а+)(1—)-+(3-+1)г}° adis 
f(z) = [0 2) — 0 + (3+ 20)р 

Ро) = 01+ 3001 — 0 + (3+ 30] 


Figure 16.14 shows the trajectories for nine similar complex numbers іп the second 
quadrant, where the squaring function imposes an anticlockwise rotation on the 
trajectories, with the end complex numbers in the third or fourth quadrant. 

Figure 16.15 shows the trajectories for nine complex numbers in the third quadrant, 
where the squaring function imposes a clockwise rotation on the trajectories, with 
the end complex numbers in the first or second quadrant. 

Figure 16.16 shows the trajectories for nine complex numbers in the fourth quad- 
rant, where the squaring function imposes a clockwise rotation on the trajectories, 
with the end complex numbers in the third or fourth quadrant. 
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Fig. 16.14 The trajectories 
of nine complex numbers in 
the second quadrant, when 
squared 


Fig. 16.15 The trajectories 
of nine complex numbers in 
the third quadrant, when 
squared 


Im 
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Fig. 16.16 The trajectories Im 
of nine complex numbers in 
the fourth quadrant, when 
squared 


Fig. 16.17 The trajectories Im 
of eight, squared imaginary Ese co ae i 
numbers 


The only remaining numbers to consider are on the real and imaginary axes. 
The real axis is simple, as the square of any real number is another real number. 
Figure 16.17 shows the anticlockwise trajectories of four positive imaginary numbers, 
and the clockwise trajectories of four negative imaginary numbers. 


16.3.16 The Hyperbolic Functions 


The trigonometric functions derive from the geometry of the circle x? + y? = 1, 
whereas the hyperbolic functions are associated with the geometry of the hyperbola 
x? — y? = 1. However, they are all related to e as we will see. 


16.3 Complex Algebra 


Given: | | 
ei? + ете 
соөб------ 
then: 
eiO) 4 gie) 
соѕ(10) = —— ——— —— 
(10) 5 
Е ее + °° 
2 
= cosh 0 
Similarly, given: 
7r е” E ет” 
sind = zi 
then: 
eli? e i? 
sin(10) = 
(9) 21 
e? — еб 
я 
ECL 
2 
= isinh ө 
By definition: 
ТТ sinhO е-е 
ап = = . 
coshO е? +e? 
Therefore: 
Ө E 
cosh 9 = Lt = cos(id) 
e? — e? "V 
sinh Ө = NE 7 = —1511(10) 
ей — e? 
tanh 6 = e) е9 = —i (ап(10) 


cosh Ө + sinh@ = e? 
cosh 0 — sinh 0 = e^? 


cosh? 0 — sinh? 0 = 1 


Figure 16.18 shows how sinh and cosh relate to the hyperbola. 
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Fig. 16.18 The hyperbolic 
functions | 


Ua! 
>< 


cosh Ё 


1 sinh ¢ 


16.4 Summary 


Hopefully, this chapter has established i = 4/—1 as an incredible invention. Even 
though it does not belong to the traditional number systems, it is a valid math- 
ematical object and reveals hidden numerical relationships between various con- 
stants and functions. Perhaps the two outstanding examples being e + 1 = 0 and 
i! = 0.207879... 

The complex plane provides a simple way of visualising complex numbers, and 
illustrates their connection with vectors. 

Euler’s proof for e = соз Ө + isin@ opens the door for associating complex 
numbers with wave phenomena, which include acoustic waves, sea waves, electronics 
and quantum mechanics. 


16.5 Worked Examples 


16.5.1 Complex Addition 


Compute (3 + 2i) + (2 + 2i) + (5 — 3i). 


Solution 
Collect up like terms: 


(3 + 21) + (2+ 21) + (5 — 31) = 10 +i 


16.5 Worked Examples 


16.5.2 Complex Products 


Compute (3 + 21) 2 + 21) (5 — 31). 


Solution 
Expand algebraically and simplify: 


(3 + 2i)(2 + 21) (5 — 31) = (3 + 2i) (10 — 6i + 10i + 6) 
= (3 + 2i) (16 + 4i) 
= 48 + 12i + 32i — 8 
— 40 4- 44i 


16.5.3 Complex Division 
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| 
Compute ODED: 
Solution 
Expand the denominator, then multiply top and bottom by the denominator’s conju- 
gate: 
1 _ 1 
(2-931)4  23+2i 
2 1 (23 — 21) 
~ Q342) (23-21) 
_ 23—21 
| §29+4+4 
1 
= — (23 — 21 
533 ( ) 


16.5.4 Complex Rotation 


Rotate the complex point 3 + 21 by +90° апа +180°. 


Solution 
Multiply by +i and — 1. 
To rotate --90? (anticlockwise) multiply by i: 


(3+2) SHH 22-2343 
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To rotate —90° (clockwise) multiply by —i: 
—i(3 + 21) = —3i +2 = 2 — 31 
To rotate +180° (anticlockwise) multiply by —1: 
—1(3 + 2i) = —3 — 2i 
To rotate —180° (clockwise) multiply by —1: 


qoo = 


16.5.5 Polar Notation 


Given zı = 5 + 32 and 22 = == + i, compute their product using standard 
complex number format, and polar notation. 


Solution 
Standard complex number format. Expand algebraically: 


11 
NE 
--і 


Solution 
Polar notation. Compute the amplitude and argument for 21 and 22; multiply the 
amplitudes, and add the arguments: 
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zı = (1, 45°) 
z2 = (1, 135°) 
2122 = (1, 180°) = —1 


16.5.6 Real and Imaginary Parts 


Find the real and imaginary parts of 1/(1 + e!?°). 

Solution 

Multiply top and bottom by the conjugate, expand and isolate the real and imaginary 
parts: 


1 1 +е29 
1+е2 — (1+ е29)(1 + ei) 
1 + соѕ(20) — isin(20) 
72-722 со5(20) 
_ 1 + соѕ(20) . Sin(20) 


— 24200809) 24 2cos(20) 


R 1 1 
е - = — 
1+ e? 2 


1 1/ sin(26) 
Im | = 
(==) 2 (Eu) 


16.5.7 Magnitude of a Complex Number 


Find |... 


Te? |: 


Solution 
Use zz — |z|? and expand: 


2 1 


m (1 + е20)(1 + e-i29) 
1 
“"іреіее ү] 
1 
= 2-2 сов(20) 


1 
1+ ei20 
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= [2 + 2cos(20)]-!? 


| 1+ ei20 


16.5.8 Complex Norm 


Find the complex norm of z = 5 + 121. 


Solution 
Use |z| = Va? + 22: 
| = у 52 + 122 = 13 


16.5.9 Complex Inverse 


Find the inverse of 1 + i. 


Solution 
Multiply top and bottom by the conjugate and expand: 


(1—1) 1 


Laii = p 
Е иро 


16.5.10 de Moivre’s Theorem 


Express cos(50) in terms of cos 0, and sin(50) in terms of sin Ө. 


Solution 
Use (cos Ө + i sin 0)" = cos(n0) + 1sin(n0) and simplify: 
cos(50) + isin(50) = (соз Ө +isin@)> 
= cos? 0 + i5 cos^ Ө sin Ө + i?10 cos? 0 sin? Ө 
+ 1210 cos? 0 sin? Ө + i*5 cos Ө sinf 0 + i? sin? 0 
= cos? 0 — 10 cos? 0 sin? 0 + 5 cos 0 sint 0 
+ i(5cos* 0 sin@ — 10 cos? 0 sin? Ө + sin? Ө) 
cos(50) = Re[(cos Ө + isin0)?] 


= cos? 0 — 10 cos? 0 sin? 6 + 5 cos 0 біп” 0 
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2% 


y = -20cos38 у = 16соѕ50 


15 
y = 5cos8 


ly = cos(58 


ER 
Y 


0 т 27 т 


Fig. 16.19 со5(50) = 16cos? 0 — 20 cos? 0 + 5 cos 0 


but sin? Ө = 1 — cos? 8: 


cos? Ө — 10 cos? 0(1— cos? 0) + 5cos0(1 — cos? 0)? 

cos? 0 — 10соѕ? Ө + 10 cos 0 + 5с050(1 — 2cos? 0 + cost 0) 
11 соѕ? 0 — 10 cos? 0 + 5cos0 — 10 cos? Ө + 5 cos? 0 

cos(50) = 16 соз? 0 — 20 cos? 0 + 5 cos 0 

sin(50) = Im[(cos Ө + isin0)?] 


= 5cos^ 0 sin@ — 10cos 0 sin? 0 + sin? 6 


but cos? Ө = 1 — sin? 8: 
= 5sin0(1 — sin? 0)? — 10 sin? Ө(1 — sin? 0) + sin? 0 
= 5sinÓ — 10sin? 0 + 5sin? 0 — 10 sin? 0 + 10 sin? 0 + sin? 0 
sin(50) = 16ѕіп? 6 — 20sin? 6 + 5sin6 


Figure 16.19 shows the individual waveforms contributing towards cos(50), and Fig. 
16.20, the individual waveforms contributing towards sin(50). 
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Fig. 16.20 sin(5@) = 16 sin 0 — 20 sin? 6 + 5 sin@ 


16.5.11 nth Root of Unity 


Find the 4th and 6th roots of 1. 


Solution 
Use: 


z= e" к=0,1,2,...,п—1 


(=) 2. (=) 
= cos | — | +1s1n | —— 
n n 


Substituting different values for n and k: 
When n = 4: 


© 
wn 
| 
= 


Я 
Il 


II 
| 
5 


SES >| | 


Ne Ми Ми 
| 
| 
ы 


16.5 Worked Examples 


When и = 6: 
[k = 0] 
[К = 1] 
[К = 2] 
[К = 3] 
[К = 4] 
[К = 5] 


2 = cos (2) +isin(2) =! 
E 6 6) — 
2л ‚. (2л ШЕРГЕ 
пез (2) +isin() = iem 
4n\ 2. {4л ШЕРГЕ 
С (2) +isin (2) = 211% 
2 = cos ($) + isin) = -1 
Е 6 B 
ӛлу 2. (8x 1 3 
а = eos ($) + (=) = 2 17 
10r\ .. /10л ШЕГЕ 
а = cos (Ж) + isin ( 6 )=5 1 5 


16.5.12 Roots of a Complex Number 


Find ği. 


Solution 


Convert i to polar form and use: 


6--k2 6 +k2 
Iz= 4r ( T ET E =). O<k<n-1 
n 


n 


z=0+i=(, 0) 


r=1 


N 

© 

Ш 

© 

o 

о 
раро E 


ө + ө+ю 
TE Z) +-isin ( зд Z), oses2 


3 
=) ae Е =) 
+ +151 Е + 


3 3 
л J3 4 
)жыа(2)- +6 
+ +isin ы Т = MB a) 
3 ЖЕСЕ -2 
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16.5.13 Logarithm of a Complex Number 


Compute the natural logarithm of z = 5 — 121. 


Solution 
Convert z to polar form and use In 2 = In |z| + iarg(z): 


5— 12i = 4/52 + (—12)2ей ^ 12/5) 
ж ]3e-iL176 
In(5 — 121) ~ In (13g 189) 
А Ш 13 — 1.1761 
А 2.565 — 1.1761 


16.5.14 Raising a Number to а Complex Power 


Compute 31+. 


Solution 
Use z” = e" inc. 


25-3 
w=1+i 
z” = ewinz 
In 3 z 1.0986 


(1+1) 3 1.0986 + 1.09861 
ін (1.0986--1.0986і) 


же 
ду е1:0986,,1.09861 

А: 3(cos 1.0986 + isin 1.0986) 
А: 3(0.4548 + 0.89061) 

А 1.3644 + 2.67181 
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Chapter 17 A) 
The Riemann Hypothesis geu 


17.1 Introduction 


Having described prime and complex numbers, this chapter reviews the work of 
Leonhard Euler, and the brilliant German mathematician Bernhard Riemann (pro- 
nounced Reeman) (1826-1866), and his famous hypothesis. 


17.2 Background 


Prime numbers are very easy to define: a number whose factors are 1 and itself, 
the first nine primes being 2, 3, 5, 7, 11, 13, 17, 19, 23. However, in spite of this 
simple definition, it has been impossible to find a formula that predicts primes, or 
even count the exact number of primes up to some limit. Many mathematicians have 
taken up the challenge, but all have failed. Nevertheless, their endeavours have been 
astounding and created some incredible results, formulae and conjectures. 


17.3 Euler’s Work 


17.3.1 Euler's Zeta Function 


Euler was aware that: 
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therefore: 


sin x x x X 
= 1 + +... (17.1) 


Factorising (17.1) we obtain: 


ЕУ 
-( =)( C ss) m)" = 


The x? coefficient of (17.1) is р which must equal the x? coeffiicent of (17.2): 


- ba. bo, L n 1 " 
6 (л2 4r? 9л2 1672 


1/1 1 1 1 
—-ulp'z'yg'g;nt'" 


therefore: 


л 1 1 1 1 = i 
Ex m Uo де к SE (17.3) 


Equations (17.3) and (17.4) show Euler's zeta function, which was the starting point 
for Riemann. It is defined for any real number s greater than 1 by the infinite sum: 


с ык ы (17.4) 
5 45 55 ` 


Ж 
$ 3 


eS) = disks 


When 5 = 1 we get the harmonic series: 


1 1 1 1 1.1 
£U)eI4 dico чесен 
which diverges to infinity. 

Euler's extraordinary algebraic skills revealed how his zeta function is linked to 
the primes. For example, he found individual sums for the odd and even terms in 
(17.3) as follows: 

л? 1 1 1 1 


~=atatotgt: (17.5) 


multiply both sides by 5: 
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їл? 11 11 11 1l 

16 ZE ZZ 231 pet 
111 1 
= + et (17.6) 


subtract (17.6) from (17.5): 


=statatst (17.7) 


subtract (17.7) from (17.5): 


о E О 
24 2 2 6 82 


Employing ће same algebraic strategy with the zeta function, we һауе: 


l1 1l 1 1 1 
ТТТ ы ы ыш 

1 1111 1 

( ш xe ae ag ы ae og (17.8) 
1 1111 1 

(ғ) =z tr ИТТЕ (17.9) 


Euler then used (17.8) and (17.9) to develop his famous prime product formula by 
multiplying (17.8) by +: 


1 1 11 11 11 11 
Е 20) = + + + + 


2% 3518. P? 355% 3575 


11 11 1 
ЕЕ 44e (17.10) 


Subtracting (17.10) from (17.8) we have: 


Se Нас ee (17.11) 
35 о ж үр | 


By repeating this process for the remaining terms оп the RHS of (17.11), in the limit, 


we have: 
1 l 1 : 1 : 1 l =1 
"e-s)G-s)e-z)0-z)ye- 
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and 
1 
$(5) = 
MMH 
But: 
X 1 
X = =) 
n 5 
апа 
П 1 Е 1 
өз» (-96-96-96-2)- 
25 35 55 7: 
therefore: 


1 eu 
= ns 


р p n 


which is Euler's prime product formula, and was the starting point for Riemann's 
seminal 1859 paper Ueber die Anzahl der Primzahlen unter einer gegebenen Grósse, 
which translated is About the Number of Prime Numbers Under a Given Number 
(Riemann 1859). 


17.4 The Prime Number Theorem 


An approximate formula for counting primes up to a value x, is given by: 


x 
л(х) ~ — 
Inx 


Table 17.1 shows the percentage error for increasing values of x. 
Riemann’s paper introduced an improved prime number theorem: 


a(x) ~ Li(x) 


where: 


| * 1 
Li(x) =| no 
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Table 17.1 a(x) and x/ [nx for different values of x 


d л(х) x/Inx % error 
102 25 22 -12% 

103 168 145 —13.7% 
106 78, 498 72, 382 —7.8% 
10? | 50,847,534 | 48,254,942 | —5.1% 


Table 17.2 z (x) and (х) — z (x) for different values of x 


5% л(х) Li(x) — л(х) % error 
108 5,761,455 754 0.013% 
109 50,847,534 1,701 0.0033% 
1019 455,052,511 3,104 0.00068% 
1014 | 53,204,941,750,802 314,890 0.0000006% 


Table 17.2 shows the accuracy of the prime number theorem at counting primes. 
Thus zr (x) lies between an upper value of Li(x), and a lower value of x/ In x. 


17.5 The Riemann Zeta Function 


Euler’s zeta function 6 (s) assumes that s > 1, where s € R, and is convergent. The 
breakthrough made by Riemann was to make s complex: s е C. Riemann was partic- 
ularly interested in when ¢(s) = 0, and showed that Euler’s prime product formula 
converges for complex s, with Re(s) > 1, and therefore has no zeros in the region 
marked yellow in Fig. 17.1. 

The infinite sum form of the zeta function is not very useful when one is looking 
for particular values of s that make ¢(s) = 0, which is why Riemann proposed а 
functional form (17.12) in his paper: 


TS 


2 


0) = 2a sin ( )ra 55 — s) (17.12) 


where the gamma function Г(л) = (n — 1)! 
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Fig. 17.1 Тһе complex Im 
plane for the zeta function 


= ее! 
| non-trivial zeros 
| | 


| 
| SE | 
| 


| trivial Zeros | 


Equation (17.12) applies to the entire complex plane, and one can see in the green 
zone of Fig. 17.1 values of s that make ¢(s) = 0; these are called ‘trivial zeros’, 
and arise when sin(z: 5/2) = 0 in (17.12). i.e. when s = —2n. Thus there exists an 
infinite number of points along the negative real axis, where s makes ¢(s) = 0. 

Riemann also showed that other values of s exist on a critical line Re(s) — 1/2, 
where ¢(s) = 0; these are called ‘non-trivial zeros’. Furthermore, these occur in 
conjugate pairs. i.e. 1/2 + bi. In the critical red strip of Fig. 17.1, the first such non- 
trivial zero is 1/2 + 14.134725 ...i. Others include 1/2 + 21.022040...1, 1/2 + 
25.010856...1, 1/2 + 30.425 and 1/2 + 32.935 ...1. 

The other point marked on Fig. 17.1 is 5 = 1, which makes (1) = oo. 


17.5.1 The Riemann Hypothesis 


Having gone through the above explanation, the Riemann hypothesis is extremely 
simple to state, and is the conjecture that ‘the zeta function is zero only at the negative 
even integers (trivial zeros), and complex numbers s with Re(s) — 1/2, (non-trivial 
zeros). 

Some mathematicians regard the Riemann conjecture as a major unsolved prob- 
lem, and that its proof will influence other branches of mathematics, especially the 
distribution of prime numbers. Others believe that it is false. So far, no one has been 
able to prove or disprove the conjecture. Hopefully, the prize of prize of $106, offered 
by the Clay Mathematics Institute, will keep mathematicians trying. 
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Chapter 18 A) 
Matrices б 


18.11 Introduction 


This chapter introduces the reader to matrices via geometric transforms, and covers: 
notation, vectors, null, unit, trace, determinant, transpose, symmetric, antisymmet- 
ric matrices, arithmetic operations on matrices, inverse, orthogonal matrices, and 
concludes with some worked examples. 

Matrices, like determinants, have their background in algebra and offer another 
way to represent and manipulate equations. Matrices can be added, subtracted and 
multiplied together, and even inverted, however, they must give the same result 
obtained through traditional algebraic techniques. 


18.2 Geometric Transforms 


Let P(x, y) be a vertex on a 2D shape, then we can devise a geometric transform 
where P(x, y) becomes P'(x', у’) on a second shape. For example, when the fol- 
lowing transform is applied to every point on a shape, it is halved in size, relative to 
the origin: 


x’ = 0.5x 
у’ = 0.5у 


and this transform translates a shape horizontally by 4 units: 


x =x+4 
yoy 
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Fig. 18.2 A translate transform followed by a scale transform 


Figure 18.1 illustrates two successive transforms applied to the large green star 
centred at the origin. The first transform scales the star by a factor of 0.5 creating the 
smaller yellow star, which in turn is subjected to a horizontal translation of 4 units, 
creating the blue star. 

Figure 18.2 starts with the same green star, but this time it is translated before 
being scaled. The final blue star ends up in a different position to the one shown in 
Fig. 18.1, which demonstrates the importance of transform order. 

The algebra supporting the transforms in Fig. 18.1 comprises: 


x’ = 0.5х 
у’ =0.5y 
x" — xl +4 


у” Z y! 
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which simplifies to: 


x" = 0.5х +4 
у” = 0.5у 


Whereas: the algebra supporting the transforms in Fig. 18.2 comprises: 


x =x+4 
У = у 
x" = 0.5х' 
у” = 0.5y' 
which simplifies to: 
x" = 0.5(х + 4) 
у” = 0.5у 


and reveals the difference between the two transform sequences. 


18.3 Transforms and Matrices 


Matrix notation was researched by the British mathematician Arthur Cayley around 
1858. Cayley formalised matrix algebra, along with the American mathematicians 
Charles Peirce (1839-1914), and his father, Benjamin Peirce. Previously, Carl Gauss 
had shown that transforms were not generally, commutative, i.e. Tj T? 4 ТТІ, 
(where T; and T; are transforms) and matrix notation clarified such observations. 

Consider the transform Ту, where x and y are transformed into x’ and у’ respec- 
tively: 


ca 
mal =e (18.1) 


у =cx+dy 


and a second transform То, where x’ and у’ are transformed into x" and y" respec- 
tively: 


pat eee (18.2) 


у” = Сх + Dy’ 
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substituting (18.1) in (18.2) we get: 

x" = A(ax + by) + B(cx + dy) 

у” = C(ax + by) + D(cx + dy) 


which simplifies to: 


x" = (Aa + Вс)х + (Ab + Bd)y 
у” = (Ca + Do)x + (Cb + Dd)y 


T; = (18.3) 


Having derived the algebra for T3, let’s examine matrix notation, where constants 
are separated from the variables. For example, the transform (18.4): 


e = Oe (18.4) 
y = сх + ау 
сап be written in matrix form as: 
A E. á (18.5) 
M c d y 


where (18.5) contains two different structures: two single-column matrices or column 
vectors: 


and 


and a 2 x 2 matrix: 


Algebraically, (18.4) and (18.5) are identical, which dictates the way (18.5) is con- 
verted to (18.4). therefore: using (18.5) we have x’ followed by the ‘=’ sign, and the 
sum of the products of the top row of constants a and b with the x and y in the last 
column vector: 


x'— ах + by 


Next, we have y’ followed by the ‘=’ sign, and the sum of the products of the bottom 
row of constants c and d with the x and y in the last column vector: 


y = сх + ау 


18.3 Transforms and Matrices 319 


As an example: 


x obl 4 x 
у 5 6 y 
is equivalent to: 
x’ = 3x +4y 
у’ = 5x + бу 


We can now write T; and T» using matrix notation: 


шаа le 7. я (18.6) 
у с у 

uus] 9 pea ы (18.7) 
y” C D у! 


and substituting (18.6) in (18.7) we have: 


Т; = x = A В а b x (18.8) 


у” C D c d y 


But we have already computed Тз (18.3), which in matrix form is: 


Д Аа + Вс АБ + Bd Ж 


quel te (18.9) 
у” Ca+ Dc Cb4 Dd y 
which implies that: 
A B a b | | Aa+Bc АБ+ Bd 
C D god Са+ Dc Cb+ ра 


and demonstrates how matrices must be multiplied. Here are the rules for matrix 
multiplication: 


A В Goce _ Aa+ Вс 
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1: The top left-hand corner element Aa + Bc is the product of the top row of the 
first matrix by the left column of the second matrix. 


А В ... b + Ab 4 Ва 


2: The top right-hand element Ab + Bd is the product of the top row of the first 
matrix by the right column of the second matrix. 


C D с... Ca+ Әс 


3: The bottom left-hand element Ca + Dc is the product of the bottom row of the 
first matrix by the left column of the second matrix. 


b 
с D e d e Cb Dd 


4: The bottom right-hand element Cb + Dd is the product of the bottom row of the 
first matrix by the right column of the second matrix. 
Let's multiply the following matrices together: 


4 3 5 | | @x3+4x7) (2х5+4х9 | | 34 46 
8 7 9 (6х3+8х 7) (6х5--8х9) 74 102 


18.4 Matrix Notation 


Having examined the background to matrices, we can now formalise matrix nota- 
tion. 

A matrix is an array of numbers (real, imaginary, complex, etc.) organised in m 
rows and n columns, where each entry a;; belongs to the i-th row and jth column: 


а! ар аз ‘`` (ің 
ад 022 аз с ап 
А- аз аз 033 `` аз 


Ami Am2 Am3 Сс” Amn 
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It is also convenient to express the above definition as: 


A= [ат п 


18.4.1 Matrix Dimension or Order 


The dimension or order of a matrix is the expression т x n where т is the number 
of rows, and n is the number of columns. 


18.4.2 Square Matrix 


A square matrix has the same number of rows as columns: 


а 42... Gin 1 » 

a» a») An 
A= [aij]n n = ! ‚ 6.5 6 5 7 
4 3 | 

а, а,2 А" Ann 


18.4.3 Column Vector 


A column vector is a matrix with a single column: 


2 

a21 
, eg 3 
23 

Am1 


18.4.4 Row Vector 


A row vector is a matrix with a single row: 


Е? ü cc anto e.g. E 3 5| 
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18.4.5 Null Matrix 


A null matrix has all its elements equal to zero: 


0 00 
0, = [а п = E . ; ӨБ; Өз = ооо 
058 25.24 


The null matrix behaves like zero when used with numbers, where we have, 0 + п = 
n + 0 =л and 0 x n =n x 0 = 0, and similarly, 0 + A = A +0 = А and бА = 
A0 — 0. For example: 


0 0 0 1 2 3 ] x 3 ооо ооо 
0 0 0 45 6|=|4 5 6 000-000 
0 0 0 7 8 9 7 8 9 ооо ооо 


18.46 Unit Matrix 


A unit matrix Т,, is a square matrix with the elements on its diagonal aj, to ал 
equal to 1: 


1 0 

0 1 0 100 
І, = [ав = . ко, . , eg. I, = ото 

i a 001 


The unit matrix behaves like the number | in a conventional product, where we have, 
1xn=nx 1 = п, and similarly, ТА = AI = A. For example: 
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18.4.7 Trace 


The trace of a square matrix is the sum of the elements on its diagonal aj, to али: 


Tr(A) = а 
i=l 


For example: 


1 2 
A=] 4 5 , then TA) = 1+5 +9 = 15 
7 8 


о с WwW 


The trace of a rotation matrix сап be used to compute the angle of rotation. For 
example, the matrix to rotate a point about the origin is: 


A= cos@ —sinO 


sinô сов0 


where: 
Tr(A) = 2сов0 


which means that: 


Ө = arccos CS 


The three matrices for rotating points about the x-, y- and z-axis are respectively: 


1 0 0 
Rox = | 0 cosa —sina 
О sing cosa 
cosa 0 sina 
Ray = 0 1 0 
—sina О cosa 
cosa —sina 0 


Raz: =| sina cosa 0 


0 0 1 
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and it is clear that: 

Tr(R, x) = ТК, у) = Tr(Rg,z) = 1+ 2 cos œ 
therefore: 


Tr(Ra.x) - | 


а = arccos 
2 


18.4.8 Determinant of a Matrix 


The determinant of a matrix is a scalar value computed from the elements of 
the matrix. The different methods for computing the determinant are described in 
Chap. 13. For example, using Sarrus’s rule: 


2 3 
5 6 | then: det A = 45 + 84 +96 — 105 – 48—72 =0 
8 9 


18.4.9 Transpose 


The transpose of a matrix exchanges all row elements for column elements. The 
transposition is indicated by the letter “T’ outside the right-hand bracket: 


T 
аш 412 413 ап 921 азі 
а21 аә 93 — | ао ао» аз 
аз аз азз аз 4023 азз 

For example: 

T 
124 164 
6 5 7 =| 2 8 3 
4 3 1 4. F 1 
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апа 


To prove that (АВ)! = ВТАТ, we could develop a general proof using n x n matrices, 
but for simplicity, let's employ 3 x 3 matrices and assume the result generalises to 
higher dimensions. Given: 


aj 412 413 ап 421 аз] 
— T 
A= а 422 A23 ý А = аш An аз 
аз 032 433 413° аз 433 
and 
бі бо ba бі by bai 
52 т 
В-| bi b» bs |> B = | b2 b» ӛз 
ӛзі b32 b33 біз Роз b33 
then: 
аууб + а12621 + а1з6з1 ару + а12622 + а1з632 aii bia + a12b23 + a13b33 
АВ = | ари amb + азззр anbi + азоб +azbzz bız +anbz + аззбзз 
a3ibi1 + a32b21 + a33b31 азірі2 + а32622 + a33b32 азіріз + a32b23 + a33b33 
aby + аро + a13b31 арі + а22Ь21 + a23b31 азір + a32b21 + a33b31 
ү э 
(AB) = | aubetanbytanby anbi +tanby+anby азі + a32 22 + a33b32 
411513 + а1262з + a13b33 арз + a22b23 + a23b33 азіріз + a32b23 + аззЬзз 
and 
биап + b21a12 + b31a13 Бао] + Ь21а22 + b31a23 Базі + b21a32 + b31433 
ВТАТ = 


Буза + 622а12 + b32a13 bi2a21 + 622422 + b32a23 b1231 + Боэазо + b32033 


Бузау + Роза + Вззаз Риза + 62за22 + b33a23 заза + Бзазо + Ьззазз 


which confirms that (АВ)Т = ВТАТ. 
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18.4.10 Symmetric Matrix 


A symmetric matrix is a square matrix that equals its transpose: i.e. А = AT. For 
example, A is a symmetric matrix: 


т 
12 4 12 4 
А= |2 5 з /=] 2 5 3 
4 3 6 4 3 6 


In general, a square matrix A = S + Q, where S is a symmetric matrix, and Q is an 
antisymmetric matrix. The symmetric matrix is computed as follows. Given a matrix 
А and its transpose АТ: 


а 42... Gin а аа .. ап 
921 аә ... An а an an2 
A= > АТ- 
ат аһ2 e ап аһ An e gn 
their sum is: 
2ап ао-Ғад ... аһ +ап 
ар + а21 2422 e an + ау? 
А+А = 
Gin F а, ап T an2 t 2а,п 


By inspection, А + АТ is symmetric, and if we divide throughout by 2 we have: 


S= 5 (А-А) 


which is defined as the symmetric part of A. For example: 


а 412 413 ап 421 Q3 
T 
A = 912 An аз 


аз 432 азз аз A23 азз 
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then: 
go (A + AT) 
2 
ап (а12 + a21)/2 
= | (ар + 421)/2 ао 
(аз + а31)/2 (a23 + аз2)/2 
ай 53/2 52/2 
= | 53/2 a» 51/2 
52/2 51/2 азз 
where: 


51 = az + a32 
52 = а13 + 431 


53 = d12 + аз 


Using a real example: 


0 1 
A=| 3 | ‚ А = 
426 
024 
5=| 213 
436 


which equals its own transpose. 


18.4.11  Antisymmetric Matrix 


An antisymmetric matrix is a matrix whose transpose is its own negative: 


АТ = —A 


and is also known аз а skew-symmetric matrix. 


но 


(аз + a31)/2 


a23 + a32 


KR =e uU 


433 


ON ы 
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As the elements of A and АТ are related by: 


Grow,col = —4col,row 


When k = row = col: 
акк = —k,k 


which implies that the diagonal elements must Бе zero. For example, this is an 
antisymmetric matrix: 


T 
—2 4 0 -2 4 
А-| 2 0 -3 |=-| 2 0 -3 
-4 3 0 -4 3 0 
The antisymmetric part is computed as follows. Given a matrix А and its transpose 
АТ: 
ап ар ... аш а аа .. Ani 
А а21 480222... (ои АТ 2 ао Aan ... Um 
ат аһ2 e ап аһ An e (тп 


their difference is: 


0 d12 — аз ... An — ап 


A—AT= —(a12 — ал) 0 ... Aon — An? 


(ain anı) (azn am) фа 0 


It is clear that А — АТ is antisymmetric, and if we divide throughout Бу 2 we have: 
О=5(А-АТ) 
2 
For example: 


ай ау аз а аз аз] 


> 
4 
Il 


921 422 473 E 912 4222 аз 


аз 032 433 913 423 433 
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0 (an — а21)/2 (аз- аз1)/2 
О = | (an –ар)/2 0 (аҙ - a32)/2 
(азі — аз)/2 (a2 — азз)/2 0 


and if we maintain some symmetry with the subscripts, we һауе: 


0 (аэ-ал)/2 -(аз- аз) /2 
О= | (а —a1)/2 0 (az — азә)/2 
(азі — а13)/2 -(азз- аз)/2 0 
0 43/2 -ш/2 
=| —4/2 0 41/2 
Ф/2 -а/2 0 


where: 


qı = 423 — 432 
42 = 431 — 413 


qa = 412 — 421 


Using a real example: 


014 03 4 

А=| 314 |, А=| 112 
4 2 6 4 4 6 
0 -1 0 

Q= 0-1 
0 -1 0 


Furthermore, we have already computed: 
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and 


5+0 = 


ы о c 
ҺО к. — 


18.5 Matrix Addition and Subtraction 


As equations can be added and subtracted together, it follows that matrices can also 
be added and subtracted, as long as they have the same dimension. For example: 


и 22 2 1 
A=] 14 -15 |, and В=| 4 5 
27 28 1 8 
then: 
13 23 9 2l 
А+В= | 10 -10 |, А-В=| 18 —20 
28 36 26 20 


18.6 Scalar Multiplication 


As equations can be scaled and factorised, it follows that matrixes can also be scaled 
and factorised: 


11 аш “++ din лап Лар meas Хаіз 
а21 422 ... AY Аа Ха» “+. Àd23 
AA = А 4 = 
Ami Am2 ... Amn Хат Аат? ... Хатп 
For example: 
2 3 2 4 6 
2 = 
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18.7 Matrix Products 


We have already seen that matrices can be multiplied together employing rules that 
maintain the algebraic integrity of the equations they represent. And as matrices may 
be vectors, rectangular or square, we need to examine the matrix products that are 
permitted. To keep the notation simple, the definitions and examples are restricted 
to a dimension of 3 or 3 x 3. We begin with row and column vectors. 


18.7.1 Row and Column Vectors 


Given: 
a 
А = [а b с], and B=] в 
y 
then: 
a 
AB=| a b с | в | =a +08 + су 
А 


which is a scalar and equivalent to the dot ог scalar product of two vectors. 
For example: 


10 
А=| 2 3 4 |, and В=| 30 
20 
then: 
10 
AB=[ 2 3 4 | 30 | =20 +90 + 80 = 190 
20 
Whereas: 
bu биап bian biai 
ВА = | by E (12 l= блап bnan bza 


ӛзі Ват Әзіз Буа 
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For example: 


10 20 30 40 
BA= | 30 [2 3 а |= 60 90 120 
20 40 60 80 


The products AA and ВВ аге not permitted. 


18.7.2 Row Vector and a Matrix 


Given: 
бі бо bg 
A=| ay di? in]: and B— | bj by 5 
by bm2 b33 
then: 
бі 612 bis 
АВ = [ ай a12 дё | іі bn b» 


Әлі bm2 b33 


= ШІЛ + a12b51 + 413531) (a11b12 + а12622 + 413532) (a11b13 + а12623 + a13b33)] 


The product BA is not permitted. 
For example: 


А=[2 3 4], and B= 


о мн 
л & N 
С л ч 


then: 
1 2 3 
АВ = [2 3 4] 3 4 5 
4 5 6 
4+1 


=[0+9+16) (4+12+20) (6+15+24] 
= [27 36 45] 


18.7 Matrix Products 333 


18.7.3 Matrix and a Column Vector 


Given: 
ап an аз bi 
А = | а an аз |, and B=] by, 
азі аза 433 ӛзі 
then: 
а ар аз by a11b11 + ay2b21 + a13b31 
АВ = | a a» аз bj | = | axb c anb + 473b31 
азі @32 433 ӛзі d31b11 + a32b21 + аззЁзі 


The product BA is not permitted. 
For example: 


ik 4 2 
A=] 3 4 5 |, and B=] з 
4 5 6 4 
then: 
i 23 2 2+6+12 20 
АВ = | 3 4 5 3 |=| 6+12+20 |=| 38 
4 5 6 4 84-15 4-24 47 


18.7.4 Square Matrices 


To clarify the products, lower-case Greek symbols are used with lower-case letters. 
Here are their names: 


a = alpha, B = beta, y = gamma 
А = lambda, и = mu, v = пй 
р = rho, o = sigma, т = tau 
Given: 
abe a y 
A= р а r |, and B=] À y v 
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then: 
a b g 
АВ- р а f 
и v ш 
апа 
a Ву 
ВА = À ш 
p © 7 


For example: 


then: 


АВ = 


and 


BA = 


к- 


ON A N 


ON A 


w 


ON 


л ә кє 


18.7.5 Rectangular Matrices 
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аа + bX-ccp aB+bu+co ay+bv+ct 
patgr+rp pB+qutro py+qvt+rt 
ua d vAd wp иВ +ъи + шо иу + о0о + шт 
аа + Вр+уи ab+Bqtyv ас+ Вг +уш 
ла-+ ир-+ ъи Ab+uq+evv А+ ш + vw 
patop+tu pb+oq+tv рс+ог + тш 
2 3 4 
and В-| 4 5 6 
6 7 8 
3 28 34 40 
5 =| 52 64 76 
7 8 76 92 112 
3 31 40 49 
5 |=] 49 64 89 
7 67 88 109 


Given two rectangular matrices A and B, where A has a dimension m x n, the 
product AB is permitted iff B has a dimension n x p. The resulting matrix has a 
dimension m x p. For example: 
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then: 


| аа | b b b b 
и Бо Әз ba 

АВ = | as an 
бл bn ӛз Род 


3] 32 


(аб + а12621) (атро + а12622) (a11b13 + а12623) (ард + а12624) 


(аз151 + 422621) — (a21b12 + а22622) (аруз + а22623) (a21b14 + 422624) 


(a31b11 + а32621) (аз + аз2622) (азібіз + a32b23) (аз1Ь14 + 43224) 


18.8 Inverse Matrix 


As matrix division is not possible, when two matrices have to be divided, we multiply 
by the inverse matrix. A square matrix А,, that is invertible satisfies the condition: 


—1 —1 
АА = А, Ann = I, 


nn 


where A. is unique, and is the inverse matrix of А,,. For example: 


A= 4 
5.4 
then: 
А-1 2 4 —3 
—5 4 
because: 
AAW! = 4 3 4 —3 _ 10 
5 4 —5 4 0 1 


A square matrix whose determinant is 0, cannot have an inverse, and is known as a 
singular matrix. 
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We now require a way to compute А”!, which is rather easy. Consider two linear 
equations: 


я 
в 
> 
= 


= (18.10) 
y с y 
Let the inverse of: 
а b 
с а 
Бе: 
e f 
g h 
therefore: 
E " "m (18.11) 
g h c d 0 1 
From (18.11) we have: 
ае+ сў = 1 (18.12) 
be+df = 0 (18.13) 
ag + ch = 0 (18.14) 
bg + ай = 1 (18.15) 


Multiply (18.12) Бу d and (18.13) by с, and subtract: 


ade + cdf = d 
bce + саў = 0 
ade — bce = d 
therefore: 
d 
e — 
ad — bc 


Multiply (18.12) by b and (18.13) by a, and subtract: 


abe + bcf = Б 
abe + аа} = 0 
adf —bcf = —b 
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therefore: 
—b 


Сы 


Multiply (18.14) Бу d and (18.15) by c, and subtract: 


adg+cdh=0 
bcg + саһ = c 
ай - bcg = —c 
therefore: 
—c 


ad — bc 


8 = 


Multiply (18.14) by b and (18.15) by a, and subtract: 


abg + bch = 0 
abg + adh = a 
adh — bch = a 
therefore: р 
~ ad — be 


We now have values for e, f, g and Л, which are the elements of the inverse matrix. 
Consequently: 


A= a o ‚ and А! = e f 
c d g h 
then: 
Act 1 а —b 


det A ae ü 


The inverse matrix permits us to solve a pair or linear equations as follows. Starting 
with 


multiply both sides by the inverse matrix: 


X 


= АТА 
у у 


А! 


338 18 Matrices 


A! х! = 1 0 x | [x 
у 0 1 y y 
X | А х! 
y у 
х |_ 1 а -ь х 
^ det A =F y 


Although the elements of A^! come from A, the relationship is not obvious. However, 
if A is transposed, a pattern is revealed. Given: 


А 2 ‚ апа А = а 
са b d 
and placing А”! alongside АТ, we have: 
АЕ Ў and A’ = 3 
g h b d 


The elements of А”! share a common denominator (det A), which is placed outside 
the matrix, therefore: the matrix elements are taken from АТ as follows. For any 
entry а in A~', mask out the i-th row and j-th column іп АТ, and the remaining 
entry is copied to the ij-th entry in A~!. In the case of e, it is d. For f, itis b, witha 
sign reversal. For g, it is c, with a sign reversal, and for h, it is a. The sign change is 
computed by the same formula used with determinants: 


(-1)% 


which generates this pattern: 

+ = 

=: ӘР 
You тау be wondering what happens when a3 x 3 matrix is inverted. Well, the same 
technique is used, but when the i-th row and j-th column in АТ are masked out, it 


leaves behind а 2 x 2 determinant, whose value is copied to the ij-th entry in A~', 
with the appropriate sign change. We investigate this later on. 
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Let's illustrate this with an example. Given: 


42 = 6x + 2y 
28 = 2х + Зу 
let: 
A= 2 
3 
then det А = 14, therefore: 
x | 1 22-2 42 
y ЕН 2 6 28 
_1| 7 
14 | g4 
_ |5 
6 


which is the solution. 


Now let’s investigate how to invert a 3 x 3 matrix. Given three simultaneous 
equations in three unknowns: 


x’ = ах by + cz 
y = ах + еу + fz 
z = вх + һу + jz 


they сап be written using matrices as follows: 


a с X X 
жм ажа og Emp oy 
2 gh j 2 2 
Гес 
Г m n 
АС =| ра! 
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therefore: 
l m n a b c 1 0 0 
р 4 т d e f |-|010 (18.16) 
s t u g h j 0 0 1 
From (18.16) we can write: 
la+md+ng=1 (18.17) 
lb+me+nh=0 (18.18) 
lc+mf nj = 0 (18.19) 
Multiply (18.17) by e and (18.18) by d, and subtract: 
ael + dem + egn = e 
bdl + dem + dhn = 0 
ael — bdl + egn — dhn = e 
l(ae — bd) + n(eg — dh) = e (18.20) 
Multiply (18.18) by f and (18.19) by e, and subtract: 
bfltefm+ fhn=0 
се + еўт + ејп = 0 
bfl—cel+ fhn—ejn=0 
l(bf — се) - n(fh — ej) = 0 (18.21) 


Multiply (18.20) by (fh — ej) and (18.21) by (eg — dh), and subtract: 


Кае — bd)( fh — еј) + n(eg — dh)( fh — ej) = e( fh — ej) 
l(bf — ce)(eg — dh) + n(eg — dh)(fh — ej) =0 
Кае — bd)( fh — ej) — l(bf — ce)(eg — dh) =efh— ej 
l(aefh — ae? j — bdfh + bdej — befg + bdfh + ce?g — сей) = efh — e? j 
l(aefh — ae? j + bdej — befg + ceg — cdeh) = efh — ej 
l(afh + Ба] + ceg — ае] — сай — bfg) = fh — еј 
l(aej +bfg+cdh — afh — Бај — сев) = ej — fh 


but (aej + bfg + сай — af h — Бај — сев) is the Sarrus expansion for det A, there- 


fore: 
_ in fh 
~ det A 
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An exhaustive algebraic analysis reveals: 


_ ej — fh КЕ дес; _ bf —ce 
© detA ' Ter: АҒАС? " — det A 
dj — gf aj — gc af —dc 
р=- , а = E y -—— 
det A det A det A 
_ dh — ge ды: аһ- gb | ae — bd 
^ detA ' ~ detA ' “= det А 
where: 
lm n a b c 
A= p qr |» А=| de f 
s t u g h j 


However, there does not appear to be an obvious way of deriving A^! from A. But, 
as we discovered with the 2 x 2 matrix, the transpose АТ resolves the problem: 


l mn a d g 
А”! = ра r jb А" = b eh 
sS t u € 3g 


The elements for A^! share a common denominator (det A), which is placed outside 
the matrix, therefore: the matrix elements are taken from AT as follows. For any entry 
aij in AT ! mask out the i-th row and j-th column in АТ, and the remaining elements, 
in the form of a2 x 2 determinant, is copied to the i j-th entry in A^. In the case of /, 
itis (ej — hf). For m, itis (bj — hc), with a sign reversal, and for n, itis (bf — ес). 
The sign change is computed by the same formula used with determinants: 


(— + J 
which generates the pattern: 
ее Б 
D + E 
adm а 


With the above aide-mémoire, it is easy to write down the inverse matrix: 


(ej - fh) 0) – ст) (bf ce) 
зад -@-&) (ај - 80) -а/-ао 
(dh — ge)  —(ah— gb) (ае- bd) 
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This technique is known as the Laplacian expansion or the cofactor expansion, 
after Pierre-Simon Laplace. The matrix of minor determinants is called the cofactor 
matrix of A, which permits the inverse matrix to be written as: 


12 (cofactor matrix of А)! 


A- 
det A 


Let's illustrate this solution with an example. Given: 


18 = 2x + 2y + 22 
20 = x +2у + 3z 


7-у--2 
therefore: 
18 25:22 22; x 
20° [= 1 2 3 y 
7 0 1 1 2 
x 
= А y 
2 
апа 
а«А-4--2-2-6---2 
210 
АТ-| 2 2 
2 3 1 
therefore: 
—1 0 2 
е 
2775 —1 —4 
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and 
x i —-] 0 2 18 
y == -l 2 -4 20 
2 1 -2 2 7 
2. 
4 


and the solutionis x = 2, у= 3, z=4. 


18.8.1 Inverting a Pair of Matrices 


Having seen how to invert a single matrix, let’s investigate how to invert a pair of 
matrices. 

Given two matrices T and В, the product TR and its inverse (TR)~! must equal 
the identity matrix I: 


(ТЕ)(ТЕ) ! =I 
and multiplying throughout by T^! we have: 


ТИВ "ЕТ! 
R(TR);!- T^ 


Multiplying throughout by R^! we have: 


В ВВ) ЕВ! 
(TR)! = RIT! 


therefore: if T and R are invertible, then: 
(TR)! = RIT! 
Generalising this result to a triple product such as STR we can reason that: 


(STR)! = RIT 1$! 
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18.9 Orthogonal Matrix 


A matrix is orthogonal if its transpose is also its inverse, i.e. matrix A is orthogonal 
if: 


For example: 


JE edt 
А-| 7 v2 
Б e 
02 V2 
and 
pl eb 
АТ = м2 м2 
zd. du 
42. 42 
and 
SEO SE WO bl 
ААТ — v2 V2 42 № | _ 1 0 
1 1 1 1 0 1 
0 V2 42. 42 
which implies that АТ = А-!. 
The following matrix is also orthogonal: 
a=] œs В —sing 
sinp соѕ В 
because: 
АТ — соѕ8 sing 
—sinf соѕ 8 
апа 
ААТ — со$8  —sin f co sinf | |1 0 
sinf соѕ В — sinf соѕ В 0 


Orthogonal matrices play an important role in rotations because they leave the origin 
fixed and preserve all angles and distances. Consequently, an object’s geometric 
integrity is maintained after a rotation, which is why an orthogonal transform is 
known as a rigid motion transform. 
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18.10 Diagonal Matrix 


A diagonal matrix is a square matrix whose elements are zero, apart from its 
diagonal: 


11 0 0 
А = 0 к 0 
0 0... аһ 


The determinant of a diagonal matrix must be: 
det A = ay, ха» X -.: X Ann 
Here is a diagonal matrix with its determinant: 
200 
А=| 030 
004 
det A = 2 x 3 x 4 = 24 


The identity matrix I is a diagonal matrix with a determinant of 1. 


18.10.1 Augmented Matrix 


An augmented matrix arises from a system of linear equations, where the value to 
the equations is incorporated with the matrix. For example, the set of linear equations 
(18.22) is represented as matrices in (18.23), where x = 1, y = 2, z = 3: 


2x + 2у- 22 = 12 
—3x—y+2z=1 (18.22) 
x—2y—2z=-11 


"MEE EE 1 (18.23) 
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However, the right-most column vector, can be incorporated with the middle matrix 
to make Gaussian elimination easier (18.24): 


х 2 2 12 
ж [ee == ® |1 (18.24) 
2 1 2 2 11 


Equation (18.24) is the augmented matrix, and is solved in Gaussian elimination. 
For example, starting with (18.24), a next step is to place a 1 in column 1: 


Row, + Row; = Row, 


x 1 0 1 
y |=] -3 -1 2] 1 
z i ex = | =li 


where the right-most column is part of the matrix update. The augmented matrix 
makes this possible. See Gaussian Elimination for a complete process. 


18.10.2 Gaussian Elimination 


Carl Gauss developed a way of simplifying and solving a system of linear equations 
by back-substitution, now called Gaussian elimination. The technique involves sim- 
plifying the augmented matrix representing the equations, by adding and subtracting 
whole or fractional rows, to or from other rows. The technique terminates by creating 
a RREF (the next entry) in the left-hand side of the augmented matrix, and the answer 
is in the right-hand side. 


2x +2y + 2z = 12 
—3x-—y+2z=1 
x—2y—2z=-11 


x 2 2 12 
y |=| -3 -1 2] 1 
2 |. 42 —2|—11 


Place a 1 in column 1 оп a background of Os in the first row. 


18.10 Diagonal Matrix 347 


Row, + Row; = Row; 


x 1 0 0 1 
y |=] -3 -1 2 1 
2 1 2 2 11 


Place а 0 in column 3 of the second row. 


Row» + Rows => Row? 


X 1 0 0 1 
y |=] —4 -3 01-10 
2 1 2 2 11 


Place а 0 in column 1 and a -3 in column 2 of the second row. 


4Row, + Row, = Row; 


x 1 0 0 1 
у |= 0 —3 —6 
© 1 2 2 11 


Place a 1 in column 2 of the second row by dividing the second row by -3. 


—1/3Row2 — Row; 


x 1 0 1 
у |= 0 1 2 
5 1-2 -2(|-1 


Place a 0 in column 1 in the third row. 


Row, + Rows = Rows; 


x 1.0 0 1 
y |=] 0 1 0 2 
2 0-2 -21-10 


Place а 0 in column 2 in the third row. 
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2Row2 + Rows = Row; 


x 10 1 
y |=] 0 1 0 2 
2 0 0 -2|-6 


Place a 1 in column 3 of the third row by dividing the third row by –2. 


—0.5Row3 = Rows; 


x 1001 
y |=] 0 1 2 
2 0013 
х= 1 
у= 2 


18.10.3 Reduced Row Echelon Form 
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The Reduced Row-Echelon Form is a matrix form that exists after Gaussian elimi- 
nation, and also goes under the initials RREF. The reduced row echelon form arises 
when simplifying systems of linear equations. For example, we know that the fol- 


lowing equation: 
2х +4y — 6z = 20 


can be written: 
x+2y—3z= 10 


without disturbing the equation. Therefore, the RREF strategy derives from such 


techniques when working with matrices. 


A matrix is in a reduced row echelon form if it satisfies the following conditions: 


e The leading in each nonzero row is | (called a leading one). 


e Each column containing a leading one has zeros in all its other entries. 


For example, see the section on Gaussian elimination: 


о с н 
о о 
— о o 
шо N = 
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18.11 Worked Examples 


18.11.1 Matrix Inversion 


Invert A and show that АА”! = L. 


A= 2 
24 
Solution 
Using: 
a 1 а —b 
^ det A erg 
then det A — 2 and: 
тесе 1 4 —5 
2| 2 3 
Calculating ААТ ': 
1 - 1 
re asd 3 5 4 5 _1 
2124 ao in 2 
18.11.2 Identity Matrix 
Invert A and show that АА”! = Is. 
234 
А=| 121 
567 


Solution 
Using Sarrus’s rule for det A: 


det А = 28 + 15 + 24 — 40 — 12 


21 = 


349 
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therefore: 
2- 1..5 
AT=| 3 2 6 
417 
i (14-6) —(21- 24) (3 — 8) 
A'--c|-0-9 (14-29 -Q-4 
(6-10) —(12- 15) (4 — 3) 
8 3 —5 
1 
=-5| -2 -6 2 
-4 3 1 
and 
| 23 4 8 3 —5 
AA --< 12 qe ee 
5.6 7 —4 3 1 
-6 0 
= 1 
= 6 —6 
0 —6 
100 
=|] 0 1 0 
0 0 1 


18.11.3 Solving Two Equations Using Matrices 


Solve the following equations using matrices. 


20 = 2x + 3y 
36 = 7х + 2y 
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Solution 
Let: 
A= 2 
2 
therefore: det A = —17, and: 
Acc. ЕЯ 2 —3 
17| 7 2 
therefore: 
x Wits es 1 20 
y 17 36 
Zn 1 — 108 
17 | 1404+72 
EE 
CUYTI 
4 


therefore: x — y — 4. Wecan prove these values by making sure that they satisfy the 
original equations as follows: 


20 = 2x + 3y 
-2х4--3х4 
= 20 

36 = 7x + 2y 
=7х4+2х4 
= 36 


18.11.4 Solving Three Equations Using Matrices 


Solve the following equations using matrices. 
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10-2х--у-2 
13--х-у--2 
28--х--2у--2 


Solution 
Let: 


Using Sarrus’s rule for det A: 


det A = —2—14+2+1-44+1=-3 


therefore: 
2 —1 -1 
А=| 1 -1 2 
—] 1 1 
"RB c Lo x 
A'"--4|-c1«D0 (02-1) -@-0 
СИЕ. eae Do 241) 
—3 -3 0 
_ 1 
=-3 (0) 1 -1 
-3 -5 -l 
therefore: 
x ў -3 -3 0 10 
y 2 0 1 -1 13 
7 -2 -5 -1 28 
—30 — 39 
LE 13 — 28 
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—69 
--і| —15 
—123 
23 
=| 5 
41 


therefore: x = 23, у= 5, z = 41. We сап prove these values by making sure that 
they satisfy the original equations as follows: 


10=2x+y-z 
=2х23 +5 – 41 
= 10 

1ІЗ--х-у--2 
--23-5--41 
- 12 

28--х--2у--2 
--23--2х5--41 
= 28 


18.11.5 Solving Two Complex Equations 


Solve the following complex equations using matrices. 
7 + 81 = 2x + y 
-4-і-х-2у 


Solution 
Let: 
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therefore: det A = —5, and: 


АТ — 2- 4 
1—2 
де В —2 —1 
5| -1 2 
therefore: 
x __1 —2 —1 7+ 81 
y 5| 1 2 der 


1| -14— 16i 4i 
M m mS 


—. 1| -10- 15i 
5 | —15— 10i 

EE 
342i 


therefore: x = 2 + 31, у = 3 + 21. We can prove these values by making sure that 
they satisfy the original equations as follows: 


74+ 81 = 2х у 
=2х (2 + 3i) + 3+ 21) 
=4+6 +3 + 21 
= 7 + 81 
—4-i=x-2y 
=2+ 31-2 х (3 + 21) 
-2--31-6-ді 
--4-і 


18.11.6 Solving Three Complex Equations 


Solve the following complex equations using matrices. 
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Solution 
Let: 


(25-1) -(-24-1) 0 
AN = 3] —(—4+1) (-2-1) -0+2 


(2-1) -(1--) (-1-2) 
therefore: 
x 1 1 0 0 
y = 3 3 -3 -3 3 + 31 
2 1 2 3 5—51 
1 3 + 31 
EL —9 — 9i + 15 + 151 
—6 — 6i 4- 154 151 
1+1 
= |2 +21 
3 + 31 


егеѓоге: х = 1+1, у=2 +21, z= 3 + 31. Ме сап prove these values by mak- 
ing sure that they satisfy the original equations as follows: 
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O=x+y-z 

= (1 +i) + (2 + 2i) — (3+ 3i) 
=3+4+31-3-31 

=0 


34+31=2x-—y+z 
=2х (1+1 - (2+2 + (3 + 31) 
—2-2i—2-—2i- 3 + 31 
=3 +31 
5-5і--х--у- 22 
--(1-0)--(2--21)-2х(3--31) 
1-1-6-бі 
= —5 – 51 


18.11.7 Solving Two Complex Equations 


Solve the following complex equations using matrices. 


3+ 5i=xi+2y 
5 +1= 3х – у 


Solution 
Let: 
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therefore: 

x 1 -і —2 3+ 51 

y 5| -3 i 5+i 


21| -%%5-10-2і 
5 | —9— 15i45i—1 


. 1| -5-зі 
5 | —10-— 10i 
ШЕР? 
2+2ї 


therefore: x = 1 +i, у = 2 + 21. We can prove these values by making sure that 
they satisfy the original equations as follows: 


345i — x14 Dy 
іх(1+0 +2 х (2+ 21) 


= —-1+14+4 +4 
= 3 + 51 
5 +1= 3х — yi 
=3 x (1+1 – 10+ 21) 
= 3 +31 21+ 2 
=5 +і 


18.11.8 Solving Three Complex Equations 


Solve the following complex equations using matrices. 


6 +2i = xi+2y — zi 
—2 + 6i = 2x — yi + 2zi 
2-- 10i = 2xi + yi 4-2z 
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Solution 
Let: 
i 2 -1 
А=| 2 -i 2i 
2i i 2 


therefore: det A= 2 — 8 + 2 + 21 + 2i — 8 = —12 + 4i, and 


i 2 2i 
АТ=| 2 -i i 

EON 
(214-2) -(4-І) (азы) 
-4-4  Qi-2 -(-2+2i) 


oe 
—12 + di 
(0—2) ecd cd). (1—4) 

| 2—5 = dq 
= cg. —2+2i 2-2 


© 124 4j 
—2--2i 1+4 -3 


therefore: 
1--41 6--21 


2—2 —2+ 61 
-3 2 -F 10i 


-3 
-2--21 
=2+21 1-4 
1 (2 — 21) (6 + 12) — 3(—2 + 61) + (1 + 41) 2 + 101) 
—8(6 + 21) + (—2 + 21) (—2 + 61) + (2 — 21)(2 + 101) 


(—2 + 21)(6 + 2i) + (1 + 41)(—2 + 61) — 3(2 + 101) 
12--41- 1213-4 4- 6 — 181+2+4 101 + 81 — 40 
12 + 4 + 20i — 41 + 20 


1 
= рр | ~48-16i +4 - 12i- 4i- 
—12-4+14—4—2+6—8—24—6—301 
—16 — 81 
— І . 
Spa | 732- 16i 


—48 — 24i 
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multiply by the conjugate of —12 + 41: 


X 


y 


therefore: 


= 


—16 — 81 
—12 — 4i 
= ^ 160 _ —32 — 16i 
—48 — 24i 
| (—12 — 4i)(—16 — 8i) 
160 


1 
— (192 + 641 + 96i — 32 
165092 + 64i + 96i — 32) 


1 
—— (160 + 1601) 


160 
1+1 
1 

12-41 2- 161 
160‘ i)(—3 бі) 
1 
—— (384 + 1281 4- 1921 — 64 
160° + i+ 1 ) 
1 
— (320 + 3201 
160“ + 3201) 
2+ 21 
1 

12-41 48-241 
160 1)( 1) 
1 
— (57 1921 + 2881 — 
1609 6+ 1921 + 2881 — 96) 
1 
— (480 + 4801 
160 + 4801) 
3+31 


егеѓоге: х =1+i, y=2+2i, z= 3 31. Ме сап prove these values by mak- 
ing sure that they satisfy the original equations as follows: 


6 4-21 = xi+2y — zi 


= i(1 +i) + 2(2 + 21) — i(3 + 31) 
—i—1-4444i—3i 43 
—64 2i 


—2 + 6i = 2x — yi + 221 
= 2(1 +i) —i(2 + 2i) + 2i(3 + 3i) 
—2-F2i—2i42--6i—6 


2+ 6i 
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2+ 10i = 2xi+ yi+2z 
= 2i(1 + i) + i(2 + 2i) +26 + 31) 
-21-2-421-2--6-4-бі 
=2+4 10i 


18.11.9 Augmented Matrix 


Make the following matrix into an augmented matrix. 


x | | 2 4 12 
y 4 3 16 
Solution 
x | | 2 4412 
y 4 3] 16 


18.11.10 Gaussian Elimination 


Solve the system of linear equations using Gaussian elimination: 


2x + 2y + 2z = 30 
-у-22--і7 
х-2у-22--26 


Solution 
The solution terminates when the augmented matrix contains a RREF. 
Starting with the following augmented matrix. 


x 22 |30 
у|=| 0 —-1 —2|-17 
z 1 0 |296 


Place а 1 in column 1 апа Os column 2 and 3 of the first row. 
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Row, + Row; = Row; 


x 0 0 4 
y |= 0 -І —2|—17 
2 1 2 2 26 


Place a | in column 2 of the second row by reversing the sign of the second row. 


—Row?2 => Row? 


x 1 0 0 4 
y |= 0 1 17 
2 1 2 2; 26 


Place a 1 in column 3 and a 0 in column 2 of the third row. 


Row; + 0.5Коууҙ => Rows; 


x 1 0 0/4 
3- Бе 0 12117 
2 —0.5 0114 


Place а 0 in column lof the third row. 


0.5Row, + Row; => Rows; 


X 10014 
y |=] 0121417 
2 0 0 6 
Place a 0 in column 3 of the second row. 
Row» — 2Row3 = Row» 
x 10 4 
у |=] 0 1 075 
2 0 0 6 
x=4 
y=5 


2-6 


Chapter 19 A) 
Geometric Matrix Transforms сеек for 


19.1 Introduction 


In this chapter, we build upon the ideas of matrices described in the previous chapter, 
and provide a coherent framework for describing transforms in two and three dimen- 
sions. 

Geometric matrix transforms are an intuitive way of defining and building 
geometric operations such as scale, translate, reflect, shear and rotate. In 2D, such 
operations are generally associated with images and text, and widely used in internet 
browsers, image-processing software, smart phones and watches. In 3D, they are 
used in computer games, computer animation, film special effects, virtual reality 
and scientific visualisation. They have proved so useful that they are incorporated in 
hardware to provide the highest possible execution speeds and real-time performance. 


19.2 2D Transform 


The general 2D transform is: 
y =cx+dy (19.1) 
x^ [jab||x 
yj [а] [у 
where the values of a, b, с and d determine the type of transform. Let’s examine 


2D transforms and generalise their application to 3D, and start with the translate 
transform, as this reveals a fundamental problem with matrices. 


or in matrix form: 


© Springer Nature Switzerland AG 2024 363 
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19.2.1 2D Translation 


A 2D translation transform is described by: 


x =x4+h 


y-y-ct, 
where the point (x, y) is translated by (1,, ty). Modifying (19.1), this becomes: 


x’ =axt+byt+t, 
y =cex+dy+t, 


where а = d = 1, and b = с = 0. However, this does not appear to have a single 
matrix representation, due to the addition of 1, and t,. Fortunately, homogeneous 
coordinates come to the rescue, and support any type of transform incorporating 
addition or subtraction. The idea is to solve a 2D problem in 3D, where any point 
(x, y) becomes (zx, zy, z), where the latter is called a set of homogeneous coordi- 
nates for (x, у). As z = 1 in this case, the homogeneous coordinates аге (x, у, 1). 
Rewriting (19.1) in 3D, we have: 


х =ax+byth 
у = сх + ау +} 


1= 0х +0у +1 

ог in matrix form: 

х! abt, x 

у {= | сау у 

1 001 1 
The 2D translation transform is: 

х! 101, x 

y |=] 01% y 

1 00 1 1 


Figure 19.1 shows a shape translated by (5, 2) using this matrix: 


х! 105 х 
у [= [012 y 
1 001 1 


As we аге only interested іп (x’, y’), the z-coordinate is ignored. 
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Fig. 19.1 Тһе blue shape is y 
the translated yellow shape 


к 


19.2.2 2D Scaling 


2D scaling is achieved using: 


X = 5х 
y = зуу 
or as a homogeneous matrix: 
x" sy 00 x 
у | =] 050 y 
1 001 1 


The homogeneous form is maintained as we often have to combine different matrices. 
Figure 19.2 shows the effect of scaling a shape by times 2 horizontally, and times 
1.5 vertically. 


Fig. 19.2 Asymmetric y 
scaling relative to the origin 


к 
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The scaling action is relative to the origin, i.e. the point (0, 0) remains unchanged. 
All other points move away from the origin when the scale factor exceeds 1, or 
towards the origin when it is less than 1. To scale relative to another point (px, ру) 
we first subtract (px, py) from (x, y). This effectively makes the reference point 
(px, Py) the new origin. Second, we perform the scaling operation relative to the 
new origin, and third, add (px, py) back to the new (x, у) to compensate for the 
original subtraction. Algebraically this is: 


x! = 5 (х — px) + Px 
y =sy(y — py) + Py 


which simplifies to: 


x! = sx py(1 — s) 
y = syy + py(1 — sy) 


or as a homogeneous matrix: 


x 5, 0 p(l — sx) х 
у |} =] 0 sy p(l- sy) у (19.2) 
1 00 1 1 


Figure 19.3 shows a scale of times 2 horizontally, and times 1.5 vertically relative 
to the point (—3, 0) using this matrix: 


! 203 
y |=] 01.50 y 
1 001 1 


Fig. 19.3 Asymmetric 
scaling relative to (—3, 0) 
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19.2.3 2D Reflection 


A 2D reflection transform about the y-axis is: 


at —100 x 
y |=] 010 y (19.3) 
1 001 1 

or about the x-axis: 
x 100 X 
y [= | 0-10 y (19.4) 
1 00 1 1 


where (19.3) reverses the sign of the x-coordinate, and (19.4) reverses the sign of the 
y-coordinate. However, to make a reflection about an arbitrary vertical or horizontal 
axis we need to introduce some more algebraic deception. 

To make a reflection about a vertical axis x = ах, we first subtract а, from the x- 
coordinate. This effectively makes the line x = a, coincident with the major y-axis. 
Next, we perform the reflection by reversing the sign of the modified x-coordinate, 
and finally, we add a, to the reflected coordinate to compensate for the original 
subtraction. Algebraically, the three steps are: 


Хі-х-а, 


хо = —(x — ay) 
ж = —(x — ax) + a 
which simplifies to: 
х = —x + 2a, 
= у 
ог as a homogeneous matrix: 
x’ —] 0 2a, x 
y l=] 010 y (19.5) 
1 001 1 


Figure 19.4 shows а shape reflected about the line x = 3 using (19.6): 


x! -106][x 
y|=|010]] у (19.6) 
1 001 | |1 
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Fig. 19.4 Reflecting a shape 
about the line x = 3 


To reflect a point about the line у = ay, the following transform is required: 


x =x 
y = -(у -ау)--а, 
=-у+2а, 
ог as a homogeneous matrix: 
х! 10 0 x 
У | = | 0-1 2a, y 
1 00 1 1 


19.2.4 2D Shearing 


A 2D shear parallel to the x-axis is achieved by adding a portion of the y-coordinate 
to the x-coordinate, and the y-coordinate remains unchanged: 


x’ =x+ay 
yoy 
or as a homogeneous matrix: 
х! 120 x 
y |=]010 y 
1 001 1 


A shear parallel to the y-axis is achieved by adding a portion of the x-coordinate to 
the y-coordinate, and the x-coordinate remains unchanged: 
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Fig. 19.5 Shearing a shape 
by В 


х'=х 


у = у+Ах 
ог as a homogeneous matrix: 
х! 100 х 
y |= [А10 y 
1 001 1 


If a specific angle is required, let А = tan В. Figure 19.5 shows shearing relative 
to the x-axis by an angle В: 


x’ = х + уп В 


y=y 
or as a homogeneous matrix: 
x 1 tan В 0 x 
у|-|01 0 y 
1 0 0 1 1 


19.2.5 2D Rotation 


Very often we need a 2D rotation transform to rotate a shape or an image about 
the origin, or some other point, and in this section we investigate the underlying 
geometry. Figure 19.6 shows a point P(x, y) rotated by an angle В about the origin 
to Р(х, у’). From the figure: 
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Fig. 19.6 Rotating a point 
through an angle В , Р(х’ у’ 


> 
X 
x’ = r cos(0 + В) 
y —rsin(0 + В) 
х 
— = с050 
F 
ae sin Ө 
ғ 
and substituting the identities for cos(0 + 8) апа sin(@ + В) we have: 
x’ = r(cos0 cos В — sin@ sin f) 
у’ = r(sin6 cos В + cos0 sin В) 
/ x S Y 
X =r (= cosg — Z sing) 
r r 
/ y X o. 
y =r (Ž cosg + - sing) 
r r 
x' = x cos В — ysin В 
y' = xsin В + y cos В 
ог as a homogeneous matrix: 
x! cos В — sin В 0 x 
y |=] sinf cosg 0 y (19.7) 
1 0 0 1 1 


and is the general transform for rotating a point about the origin. 
Figure 19.7 shows the effect of rotating the yellow arrow by 90° using this matrix: 


х” 


0-10 
у! = 100 
1 001 


mS ы 
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Fig. 19.7 The yellow shape 
is rotated 90° 


When В = 360° the matrix becomes the identity matrix, and has a null effect: 


x! 100||x 
y [= [010 у 
1 001 1 


To rotate a point (x, y) about an arbitrary point (р;, ру) we first, subtract 
(Px, Py) from (x, y), which enables us to perform the rotation about the origin. 
Second, we perform the rotation, and third, we add (р,, ру) to compensate for the 
original subtraction. Here are the steps: 

1. Subtract (px, py): 


Xp =X — Px 


у= у = Py 
2. Rotate В about the origin: 


X2 = ху cos В — y, sin В 


yo = xı sin В + yı cos В 
3. Add (px, ру): 


x' = x, cos В — y, sin B + p, 
у = xı sin B + yi cos В + py 


Simplifying: 


x’ = x cos B — y sin В + р»(1 — cos В) + py sin В 
у = xsin $ + y cos B + p,(1 — cos В) — р, sin В 
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and as a homogeneous matrix: 


х! cos B – sin В р; (1 — cos В) + py sin В x 
y | =] sinf соѕВ р,(1-совВ)- p, sin В y (19.8) 
1 0 0 1 1 


To rotate a point 90° about the point (1, 1) (19.8) becomes: 


/ 


1 


х 0-12 

у! = 100 y 

1 001 1 

A simple testis to substitute the point (2, 1) for (х, у), whichis transformed correctly 
to (1, 2). 

The algebraic approach in deriving the above transforms is relatively easy. How- 
ever, it is also possible to use matrices to derive compound transforms, such as a 
reflection relative to an arbitrary line and scaling and rotation relative to an arbi- 
trary point. These transforms are called affine, as parallel lines remain parallel after 
being transformed. Furthermore, the word ‘affine’ is used to imply that there is a 
strong geometric affinity between the original and transformed shape. One can not 
always guarantee that angles and lengths are preserved, as the scaling transform can 
alter these when different x and y scaling factors are used. For completeness, these 
transforms are repeated from a matrix perspective. 


19.2.6 2D Scaling Relative to a Point 


The strategy used to scale a point (x, y) relative to some arbitrary point (px, ру) 
is to first, translate (— px, —py); second, perform the scaling; and third translate 
(px, Py). These three transforms are represented in matrix form as follows: 


/ 


f 


у |= [ translate(p,, ру) | [ scale(s,, Sy) | | ranslate(— px, —py) ] 
1 


eS ы 


which expands to: 


х! 10р, s, 00 10 -px x 
У |=] 01 py 05,0 01-р, y 
1 00 1 001 00 1 1 


Note the sequence of the transforms, as this often causes confusion. The first trans- 
form acting on the point (x, y, 1) is translate (-р,, -ру), followed by scale 
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(sx, Sy), followed by translate (p,, ру). If they are placed in any other sequence, 
you will discover, like Gauss, that transforms are not generally commutative! 

Now we concatenate these matrices into a single matrix by multiplying them 
together. This can be done in any sequence, so long as we preserve the original order. 
Let's start with scale (s,, sy) and translate (-р,, — ру) matrices. This produces: 


х! 10 Px Sx 0 —Sx Px X 
у |= [01р | | 0 sy —sypy || y 
1 00 1 00 1 1 
and finally: 
x Sx 0 py — 5) x 
y |=] 0 Sy Py(1 — sy) y 
1 0 0 1 1 


which is the same as the previous transform (19.2). 


19.2.7 2D Reflection Relative to a Point 


A reflection about the y-axis is given by: 


х! -100 x 
y |=] 010 y 
1 001 1 


Therefore, using matrices, we can reason that a reflection transform about an arbitrary 
line x = ay, parallel with the y-axis, is given by: 
х! х 
у |= [ translate(a,, 0) | [ reflection | [ translate(—a,, 0) | y 
1 1 


which expands to: 


x! 10a, -100 10-а, x 
y |=]010 010 01 0 y 
1 00 1 001 00 1 1 


Now we concatenate these matrices into a single matrix by multiplying them together. 
Let’s begin by multiplying the reflection and the translate (—a,, 0) matrices together. 
This produces: 
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x' 10a, -І 0а, x 
У [= 1010 010 у 
1 001 001 1 
and finally: 
x! —1 02a, x 
y |=] 010 y 
1 00 1 1 
which is the same as the previous transform (19.5). 
19.2.8 2D Rotation Relative to a Point 
A rotation matrix about the origin is given by: 
x' cos B — sin 0 x 
y | =| sing cosg 0 y 
1 0 0 1| 1 


Therefore, using matrices, we can develop a rotation relative to a point (ру, ру) 
as follows: 


x x 
у | = [ translate( Px, Py) | [ rotate B | | translate(— рх, —Py) | y 
1 1 


which expands to: 


a 10 р. cos В — sin В 0 10-р, x 
У |=] Ol p ѕіп В cosg 0 01—p, y 
1 00 1 0 0 1 00 1 1 


Now we concatenate these matrices into a single matrix by multiplying them together. 
Let’s begin by multiplying the rotate 6 and the translate (-р,, -ру) matrices 
together. This produces: 


х! 10р, cos В — sin В — p, cos В + p, sin В x 
y |= | 01р, sinf cosB —p, sin В — p, cos В y 
1 00 I 0 0 1 1 
x cos В — sin В р,(1- cos В) + py sin В x 
y | = | sinf cosB p,(l— cos) — р, ма В y 
1 0 0 1 1 


which is the same as the previous transform (19.8). 
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Ihope it is now clear to the reader that one can derive all sorts of transforms either 
algebraically, or by using matrices—it is just a question of convenience. 


19.3 3D Transforms 


Now we come to 3D transforms, where we apply the same reasoning as in two 
dimensions. However, translation remains a problem, unless we move the prob- 
lem to a four-dimensional homogeneous space, which means turning (x, y, z) into 
(wx, wy, wz, w), with w = 1. Scaling and translation are basically the same, but 
in 2D, where we rotated a shape about a point, in 3D, we rotate an object about an 
axis. 


19.3.1 3D Translation 


The algebra is so simple for 3D translation, that we can write the homogeneous 
matrix directly: 

| 1001, 
^| [0105 


^| |0011, 
1 0001 


o ы 
— CS M 


19.3.2 3D Scaling 


The algebra for 3D scaling is: 


х = SX 
$ 

у = Syy 
25550 


апа as а homogeneous matrix: 


5,000 
05,00 
005.0 
0001 


х! 
у! 
z 

1 


— ANS ж 


The scaling is relative to the origin, but we can arrange for it to be relative to an 
arbitrary point (ру, Py, р.) using the following algebra: 
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№ = Sx x = Px) + Px 
y = sy(y — py) + Py 
z —s(z— Dz) + Pz 


and as a homogeneous matrix: 


Sx 0 0 p(l — sx) x 
05, 0 py — sy) y 
2 
1 


0 0 s; р. Ц — sz) 


х! 
у! 
z 
1 000 1 


19.3.3 3D Reflection 


3D Reflections occur with respect to a plane, rather than an axis. The homogeneous 
matrix giving the reflection relative to the yz-plane is: 


x! —1000 x 
y |_ |0100 у 
z' 0 010 2 
1 0 001 1 


and the reflection relative to a plane parallel to, and a, units from the yz-plane is: 


х! —1 0 0 2a, х 
y| | 010 0 y 
z| | 001 0 2 
1 0001 1 


19.3.4 3D Rotation 


In two dimensions a shape is rotated about a point, whether it be the origin or some 
other position. In three dimensions an object is rotated about an axis, whether it be 
the x-, y-, z-axis, or some arbitrary axis. To begin with, let's look at rotating a point 
about one of the three orthogonal axes; such rotations are called Euler rotations 
after Leonhard Euler. 

Recall that a general 2D rotation transform is given by: 


x' cos B — sin B 0 x 
y |=] sing cosg 0 y 
1 0 0 1 1 
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Fig. 19.8 Rotating the point y 
P about the z-axis 1 


which in 3D can be visualised as rotating a point P(x, у, z) ona plane parallel with 
the xy-plane as shown in Fig. 19.8. In algebraic terms this is written as: 


x' = x cos В — y sin В 
y' = xsin В + y cos В 
gg 


and as a homogeneous matrix: 


x! cos B —sinB 00 x 
У | | sing cosg 00 y 
zl 0 0 10||: 
1 0 0 01 1 


which rotates а point about the z-axis. 


When rotating about the x-axis, the x-coordinates remain constant whilst the y- 
and z-coordinates are changed. Algebraically, this is: 


х= х 
у’ = ycos B — z sin 
2 = ysin В + z cos В 


апа as a homogeneous matrix: 


х! 1 0 0 0 x 
y | | Ocosf —sinB 0 y 
z 0 sinf cosg 0 2 
1 0 0 0 1 1 


When rotating about the y-axis, the y-coordinate remains constant whilst the x- 
and z-coordinates are changed. Algebraically, this is: 
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x = zsin В + x cos В 
y —y 
< = 2с05 8 — хѕір В 


апа as a homogeneous matrix: 


x cos B O sin 0 x 
yl. 0 1 0 0 y 
z — sin 8 0 cos В 0 4 
1 0 0 0 1 1 


Note that the matrix terms do not appear to share the symmetry seen in the previous 
two matrices. Nothing really has gone wrong, it is just the way the axes are paired 
together to rotate the coordinates. 

The above rotations are also known as yaw, pitch and roll, and great care should 
be taken with these angles when referring to other books and technical papers. Some- 
times a left-handed system of axes is used rather than a right-handed set, and the 
vertical axis may be the y-axis or the z-axis. Consequently, the matrices represent- 
ing the rotations can vary greatly. In this book, all Cartesian coordinate systems are 
right-handed, and the vertical axis is generally the y-axis. 

The roll, pitch and yaw angles are defined as follows: 

e roll is the angle of rotation about the z-axis. 

e pitch is the angle of rotation about the x-axis. 

e yaw is the angle of rotation about the y-axis. 


Figure 19.9 illustrates these rotations and the sign convention. The homogeneous 
matrices representing these rotations are as follows: 


Fig. 19.9 Yaw, pitch and roll 
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e rotate roll about the z-axis: 


cosroll —sinroll 0 0 
sinroll cosroll 00 


0 0 10 
0 0 01 
e rotate pitch about the x-axis: 
1 0 0 0 


0 cos pitch — sin pitch 0 
0 sin pitch cos pitch 0 
0 0 0 1 


e rotate yaw about the y-axis: 
cos yaw 0 sin yaw 0 
0 1 0 0 
— sin yaw 0 cos yaw 0 


0 0 0 1 


A common sequence for applying these rotations is roll, pitch, yaw, as seen in 
the following transform: 


"E [yaw] [ pitch | (ғой | 


OV ы 
RN c2 ж 


and if a translation is involved: 


А 


х 
4 = [ translate | [yaw] | pitch] [roll] 


— м м x 


к- 


19.3.5 3D Rotation About ап Axis 


The above rotations are relative to the x-, y-, z-axis. Now let’s consider 3D rotations 
about an axis parallel to one of these axes. To begin with, we will rotate about an 
axis parallel with the z-axis, as shown in Fig. 19.10. The scenario is very reminiscent 
of the 2D case for rotating a point about an arbitrary point, and the general transform 
is given by: 
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Fig. 19.10 Rotating the 
point P about an arbitrary 


axis Р(х,у, 2) 


эче 


= | translate( рх, Py,9) | [ rotate B | | translate(— Px, —Py,9) | 


en x 


x 
y 
€ 
1 


and the matrix is: 


x cos В — sin B 0 p.(1 — cos B) + py sin В x 
y | | sing cosg 0 p,(1 — cos В) — p, sin f y 
zl 0 0 1] 0 2 
1 0 0 0 1 1 


Hopefully, you can see the similarity between rotating in 3D and 2D: the x- and 
y-coordinates are updated while the z-coordinate is held constant. We can now state 
the other two matrices for rotating about an axis parallel with the x-axis and parallel 
with the y-axis: 


e rotating about an axis parallel with the x-axis: 


x 1 0 0 0 x 
y | | 0 соѕ В —sinB p,(1 — cos В) + р. sin В y 
A О ѕіп В cosB p(l -— cos В) — py sin В 2 
1 0 0 0 1 1 
e rotating about an axis parallel with the y-axis: 
х! cos В O sinf р. (1 — cos В) — р; sin В х 
У | _ 0 1 0 0 y 
z | | —sin£ 0 cos В р„(1 — cos В) + p, sin В 2 
1 0 0 0 1 1 
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19.3.6 3D Rotation About an Arbitrary Axis 


19.3.7 3D General Rotation Matrix 


Rotating a point about an arbitrary axis is achieved in various ways. We can 
employ vectors, analytic geometry, matrices or quaternions. In this example, vectors 
are used, and Fig. 19.11 shows a view of the geometry associated with the task 
at hand. For clarification, Fig. 19.12 shows a cross-section and a plan view of the 
geometry. 

The axis of rotation is given by the unit vector: 


fi = ai + bj + ck 


P(xp, yp, Zp) is the point to be rotated by angle a to Pup. УЬ» ©). O is the origin, 
whilst p and p’ are position vectors for P and P’ respectively. From Figs. 19.11 and 
19.12: u—— 

р =ON+NQ+QP 


Fig. 19.11 The geometry 
associated with rotating a 
point about an arbitrary axis 


Fig. 19.12 А cross-section А, 
and plan view of the 
geometry 
мг Р 
9——————9 
n р 
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— 
To find ON: 
| n|| = Ірі cos 8 
fi- p = Ірі cos 0 
In] =n-p 
п = fi|[n|| 
n= fi(fi- p) 
therefore: 
— » x 
ON = п = n(n - p) 
— 
To find N Q: 
N N 
NÓ = мо Q r = cosa r 
NP МР! 
but: 
р= п +г= (№ :р) +г 
therefore: 
r = p — f(fi- p) 
and 
— » X 
МО = [р-п@ . p)]coso 
—À 
To find ОР” 
Let: 
nxp=w 
where: 
ІМІ = р sin Ө = |р| sin8 
but: 
\г| = р sine 
therefore: 
Iwill = ||г|| 
Now: 
Qm ӨР” QP _. 
= = = sina 
NP lr ll Iwl 
therefore: 


/ Я A . 
ОР = wsino = (nx р) sina 
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then: 

p = (00 - p) + [p — â - p)] cosa + (ñ х р) sina 
and 

р = pcosa + й(й -р)(1 — cosa) + (ћ х р) sina. 
Let: 

К = 1 — cosg 
then: 
р’ = pcosa+n(fi- p)K + (i x p) sing 

and 


р = (xpi + ур] + К) cosa + (ai + bj + ck)(axp + Бур + с) К 
+ [(bzp — cyp)i + (схр — azp)j + (ayp — bxp)k] sina 

р’ = [хр cosa + а(ахр + Бур +czp)K + (bz, — сур) ѕіпо]і 

[yp cosa + b(axp + Бур + cZp)K + (exp — azp) Sina]j 

[zp cosa + c(axp + Бур + czp) K + (ayp — Бх) sin a]k 

р’ = [х,(а2К + созо) + y,(abK — csing) + 2.(асК + bsina)]i 
+ [xp(abK + csing) + yp (b? K + cos a) + zp(bcK — asina)]j 
+ [xp(acK — bsina) + у, (bcK + asina) + zac + cosa)]k 


and the transform is: 


X, a?K +cosa abK —csina acK +bsina 0 Xp 
у, | _ | abK +csina DK +cosa bcK — asina 0 Yp 
А ~ | acK —bsina bcK +asina c?K + соѕа 0 Zp 
1 0 0 0 1 1 


where: 
К = 1 — cosg. 


The Worked Examples at the end of this chapter illustrate how this matrix is used. 
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The determinant of the transform (19.9) is ad — bc: 


И Е ЕМ В (19.9) 
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Fig. 19.13 Тһе inner 
parallelogram is the 
transformed unit square 


> (a+b, c+d) 


(0, 0)” 


If we subject the vertices of a unit-square to this transform, we create the situation 
shown in Fig. 19.13. The vertices of the unit-square are transformed as follows: 


(0, 0) = (0, 0) 
(1, 0) -> (а, с) 
(1, 1) 2 (ac b,c4 d) 
(0, 1) 2 (b, d) 


From Fig. 19.13 it can be seen that the area of the transformed unit-square A is given 
by: 


area = (a--b)(c--d) -B-C—-D-E-F—G 
bd 
= ас РА +bc+bd- -5 -bc ------ bc — — 
= ad — bc 


which is the determinant of the transform. But as the area of the original unit-square 
is 1, the determinant of the transform controls the scaling factor applied to the trans- 
formed shape. 

Let's examine the determinants of two transforms: The first 2D transform encodes 
a scaling of 2, and results in an overall area scaling of 4: 


EH 


|. 


whose determinant is: 
Ta 


The second 2D transform encodes a scaling of 3 and a translation of (3, 3), and 
results in an overall area scaling of 9: 
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303 
033 
001 
whose determinant is: 
33 03 03 
«1| l1] **|5s] 7? 


These two examples demonstrate the extra role played by the elements of a matrix. 


19.5 Perspective Projection 


In any 3D computer graphic application a database stores a collection of virtual 
objects in the form of Cartesian coordinates, or other permitted formats. A virtual 
camera is then located within this world space with position and direction using 
a compound transform T,. To capture a perspective view, each point (x, y, z) is 
transformed to the camera’s coordinate system (хс, Ус, Zc) using the inverse of the 
compound transform T; ! . 

A virtual camera is directed along its z-axis as shown in Fig. 19.14. Positioned 
d units along the z-axis is a virtual projection screen, which is used to capture the 
perspective projection. Figure 19.15 shows that any point (хе, Ус, Ze) is transformed 
to (Xp, Ур. d). It also shows that the screen’s x-axis is pointing in the opposite 
direction to the camera's x-axis, which can be compensated for by reversing the sign 
of x, when it is computed. 

Figure 19.15 shows a plan view of the scenario depicted in Fig. 19.14, and Fig. 
19.16 a side view, which permits us to inspect the geometry and make the following 
Observations: 


Fig. 19.14 The axial system Y, o (Xo Yo Ze) 
used for the perspective VA 
projection ү, 
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Fig. 19.15 А plan view of Xe screen 
the camera’s axial system 


(Xe, Ус Ze) 


(Or Yo d) 7 


Ze 


Fig. 19.16 A side view of Y. Screen 


the camera's axial system 
(Xe, Ve, zc) 
E] 


Ze 


Ze 


Me SITUE 
Ze d 
- 
Жасша 
PU Zefa 
апа 
Ус p 
Ze d 
y=} 
i гг/а 
This is expressed іп matrix form as: 
Xp —10 0 0 Xe 
yp} |0100 Ус 
| |0010 22 
w 0 01/40 1 
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At first, the transform seems strange, but if we multiply this out we get: 


T T 
[Xp Ур Zp w] =[—х ye ze 5/4] 
and if we remember the idea behind homogeneous coordinates, we must divide the 
terms хр, yp, Zp by w to get the scaled terms, which produces: 


х, 
x. = 
Р гг/а 
ys Ус 
P zold 
Ze 
= == а 
is гг/а 


which, after all, is rather elegant. The value of 4 controls the size of the image, and 
acts like a zoom control. Notice that this transform takes into account the sign change 
that occurs with the x-coordinate. Some books will leave this sign reversal until the 
mapping is made to the hardware display coordinates. 


19.6 Worked Examples 


19.6.1 2D Scale and Translate 


Т; and Т» translate and scale a 2D point (x, y) to (x' , у’). Concatenate them in two 
possible ways and show that the point (1, 1) is transformed to two different places. 


Solution 
! 200 X 
Т = | у | = |020 y 
1 001 1 
á 102 x 
Т- |у” | = | 012 y 
1 001 1 
/ 200 102 x 
Т.Т = | у | =| 020 012 y 
1 001 001 1 
204 x 
= | 024 y 
001 1 


and the point (1, 1) is transformed to (6, 6). 
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x! 102 200|1|х 
ТТ = | у | = |012 | [020 у 
1 001 001 1 

202 | [х 

= |022 y 

001 1 


and the point (1, 1) is transformed to (4, 4). 


19.6.2 2D Rotation 


Compute the coordinates of the unit square in Table 19.1 after a rotation of 90°. 


Solution 
The points are rotated as follows: 


x cos В — sin В 0 x 

y |=] sing cosg 0 y 

1 0 0 1 1 
0-10 x 
001 1 
0 0-10 0 
0|-|100 0 
1 001 1 
0 0-10 1 
1|-|100 0 
1 001 1 
-1 0-10 1 
1|-|100 1 
1 001 1 


Table 19.1 Original and rotated coordinates of the unit square 
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19.6.3 Determinant of the Rotate Transform 


Using determinants, show that the rotate transform preserves area. 


Solution 

The determinant of a 2D matrix transform reflects the area change produced by 
the transform. Therefore, if area is preserved, the determinant must equal 1. Using 
Sarrus’s rule: 


cos В — sin 8 0 
ѕіп В cosB 0 || = cos? 8 + sin? 8 = 1 
0 0 1 


which confirms the role of the determinant. 


19.6.4 Determinant of the Shear Transform 


Using determinants, show that the shear transform preserves area. 


Solution 

The determinant of a 2D matrix transform reflects the area change produced by 
the transform. Therefore, if area is preserved, the determinant must equal 1. Using 
Sarrus’s rule: 


1 tan 8 0 
0 1 0||-і 
0 0 1 


which confirms the role of the determinant. 


19.6.5 Yaw, Pitch and Roll Transforms 


Using the yaw and pitch transforms in the sequence yaw x pitch, compute how the 
point (1, 1, 1) is transformed with yaw = pitch = 90°. 


390 19 Geometric Matrix Transforms 


Solution 
К cos yaw 0 sin yaw 0 1 0 0 0 x 
ry 0 1 о 0 0 cos pitch — sin pitch 0 y 
A ~ | — sin yaw 0 cos yaw 0 0 sin pitch cos pitch 0 2 
0 0 0 1 0 0 0 1 1 
0 010 1000 x 
| | 0 100 00-10 y 
© | -1000 0100 2 
0 001 0001 1 
1 0100 1 
=1 | 0 0-10 1 
-1| |-1000 1 
1 0001 1 


therefore, (1, 1, 1) is transformed to (1, —1, —1). 


19.6.6 Rotation About an Arbitrary Axis 


Rotate the point (3, 0, 0), 180° about the axis defined by the vector n = і + j + К. 


Solution 
X5 а?К + cosa abK — сѕіпа acK +bsina 0 | [ х, 
у, | _ | abK +сѕіпо DK + соза bcK —asina0 | | yp 
e ~ | acK — Ьѕіпо bcK -- asina. c?K +cosa 0 Zp 
1 0 0 0 1 1 
where: 


К =1-cosa 


Given a = 180°, then К = 1 + 1 = 2, and d = —i-- zj + Łk. Therefore: 
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122. 72 
3339 
124420 
|=|2 33 0 
1 E 
3 3 3 
0 0 01 


and the point (3, 0, 0) is rotated to (—1, 2, 2). 


19.6.7 3D Rotation Transform Matrix 


Show that the matrix for rotating a point about an arbitrary axis corresponds to the 
three matrices for rotating about the x-, y- and z-axis. 


Solution 
a?K + cosa abK — csing acK + Ьѕіпа 0 


abK +csina b?K +cosa bcK —asina 0 
acK — bsina bcK +asina c?K +cosa 0 
0 0 0 1 


Pitch about the x-axis: й = i, where а = 1 апар = с = 0; К = 1 — соѕа: 


10 0 0 
0 cosa —sina 0 
0 sina cosa 0 
0 0 O0 1 


pitch — 


Yaw about the y-axis: ñ = j, where b = 1 anda = с = 0; К = 1 — cosa: 


cosa 0 sina 0 


B 0 1 0 0 
yaw = | _ sina 0 cosa 0 
0 0 0 1 


Roll about the z-axis: ñ = К, where с = 1 anda = b = 0; К =1- cosa: 


cosa — sina 00 
sina cosa 00 
0 0 10 
0 0 01 


roll = 
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19.6.8 Perspective Projection 


Compute the perspective coordinates of a 3D cube stored in Table 19.2 with the 
projection screen distance d = 20. Sketch the result. 


Solution 
Using the perspective transform: 


Xp -10 0 0 Xe 
yp} |0100 Ус 
| |0010 Ze 
w 0 01/40 1 


the perspective coordinates are stored in Table 19.2, and Fig. 19.17 shows а sketch 
of the result. 


Table 19.2 Coordinates of a 3D cube 


Vertex | xe | Ye Ze Xp | Ур 


0 0 10 0 0 
10 0 10 20 0 
10 10 10 20 20 

010 10 0 20 
20 0 0 
10 0 20 10 0 
10 0 20 10 10 

010 20 0 | 10 


бо 3 С\ tn RR ос н 
о 
о 


Fig. 19.17 А perspective 
sketch of a 3D cube 


Chapter 20 (R) 
Analytic Geometry ess 


20.1 Introduction 


This chapter explores some basic elements of geometry and analytic geometry. I have 
included a short review of important elements of Euclidean geometry with which 
you should be familiar. Perhaps the most important topics that you should try to 
understand concern the definitions of straight lines in space, 3D planes, and how 
points of intersection are computed. Another useful topic is the role of parameters 
in describing lines and line segments, and their intersection. 


20.2 Background 


Inthethird century BCE, Euclid laid the foundations of geometry that have been taught 
in schools for centuries. In the 19th century, mathematicians such as Bernhard Rie- 
mann and Nicolai Lobachevsky (1792-1856) transformed this Euclidean geometry 
with ideas such as curved space, and spaces with higher dimensions. Although none 
of these developments affect computer science, they do place Euclid's theorems in a 
specific context: a set of axioms that apply to flat surfaces. We have probably all been 
taught that parallel lines never meet, and that the internal angles of a triangle sum to 
180°, but these are only true in specific situations. As soon as the surface or space 
becomes curved, such rules break down. So let's review some rules and observations 
that apply to shapes drawn on a flat surface. 


20.2.1 Angles 


By definition, 360° is one revolution. 27 radians is also one revolution, and you 
should be familiar with both units of measurement, and how to convert from one 
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Fig. 20.1 Examples of 
adjacent, supplementary, 
opposite and complementary 
angles 


a+ 8 = 180° §+¢=90° 


to the other. Figure 20.1 shows examples of adjacent/supplementary angles (sum to 
180°), opposite angles (equal), and complementary angles (sum to 90°). 


20.2.2 Intercept Theorems 


The Intercept Theorems are attributed to the Greek philosopher and mathemati- 
cian Thales of Miletus (c.624—c.546 BC) and involve intersecting and parallel lines. 
Figures 20.2 and 20.3 show two scenarios that give rise to the following observations: 


e First intercept theorem: 


a+b с-а b d 


, 


a C a 


e Second intercept theorem: 


20.2.3 Golden Section 


The golden section is widely used in art and architecture to represent an ‘ideal’ ratio 
for the height and width of an object. Its origins stem from the interaction between 


Fig. 20.2 The first intercept 


theorem у 
€ 


d b 
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Fig. 20.3 The second 
intercept theorem 


a 


a 


Fig. 20.4 А rectangle with a 
height to width ratio equal to | 
the golden section 

6.18 


b 
d 
x 10.0 Y 


a circle and triangle and give rise to the following relationship: 


b= 5 (v5 — 1) = 0.618a 


The rectangle in Fig. 20.4 has proportions: 


height = 0.618 x width 


20.2.4 Triangles 


The rules associated with interior and exterior angles of a triangle are very useful in 
solving all sorts of geometric problems. Figure 20.5 shows two diagrams identifying 
interior and exterior angles. We can see that the sum of the interior angles is 180°, 
and that the exterior angles of a triangle are equal to the sum of the opposite angles: 


o 4- B --0 = 180° 
o =0 + В 
В = а +0 
6 =а+ В 
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20.2.5 Centre of Gravity of a Triangle 


A median is a straight line joining a vertex of a triangle to the mid-point of the 
opposite side. When all three medians are drawn, they intersect at a common point, 
which is also the triangle’s centre of gravity. The centre of gravity divides all the 
medians in the ratio 2 : 1. Figure 20.6 illustrates this arrangement. 


20.2.6 Isosceles Triangle 


Figure 20.7 shows an isosceles triangle which has two equal sides of length / and 
equal base angles o. The triangle's altitude (height) and area are: 


Fig. 20.5 Relationship 
between interior and exterior 
angles 


Fig. 20.6 The three medians 
of a triangle intersect at its 
centre of gravity 


Fig. 20.7 An isosceles 
triangle 
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20.2.7 Equilateral Triangle 


An equilateral triangle has three equal sides of length / and equal angles of 60°. 
The triangle’s altitude h and area А are: 


paa don 
2 4 


20.2.8 Right Triangle 


Figure 20.8 shows a right triangle with its obligatory right angle. The triangle's 
altitude and area are: 
ab 1 
h=—, А = -ab 
с 2 


20.2.9 Theorem of Thales 


Figure 20.9 illustrates the theorem of Thales which states that the right angle of a 
right triangle lies on the circumcircle over the hypotenuse. 


Fig. 20.8 A right triangle 


Fig. 20.9 The theorem of 
Thales 
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20.2.10 Theorem of Pythagoras 


Although the theorem of Pythagoras is named after Pythagoras, there is substantial 
evidence to show that it was known by the Babylonians a millennium earlier. How- 
ever, Pythagoras is credited with its proof. Figure 20.10 illustrates the well-known 
relationship: 

а? = 2 + с? 


from which опе can show that: 


sin? 6 + соѕ20 = 1 


20.2.11 Quadrilateral 


Quadrilaterals have four sides and include the square, rectangle, trapezoid, parallel- 
ogram and rhombus, whose interior angles sum to 360°. As the square and rectangle 
are familiar shapes, we will only consider the other three. 


20.2.12 Trapezoid 


Figure 20.11 shows a trapezoid which has one pair of parallel sides Л apart. The 
mid-line m and area are given by: 


1 
= — b 
m 24% ) 
A=mh 


Fig. 20.10 The theorem of 
Pythagoras states that a 


а? = 2 + с? 
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Fig. 20.11 А trapezoid with 
one pair of parallel sides 


Fig. 20.12 А parallelogram 
formed by two pairs of 
parallel lines 


20.2.13 Parallelogram 


Figure 20.12 shows a parallelogram which is formed from two pairs of intersecting 
parallel lines, so it has equal opposite sides and equal opposite angles. The altitude, 
diagonal lengths and area are given by: 


h=bsina 
КҮНІ CARP ae 
A=ah 


20.2.14 Rhombus 


Figure 20.13 shows a rhombus, which is a parallelogram with four sides of equal 
length a. The area is given by: 


— n2 _ 1 
А = а? sina = 244 
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Fig. 20.13 A rhombus is а 
parallelogram with four 
equal sides 


Fig. 20.14 Part of a regular 
gon 


20.2.15 Regular n-gon 


Figure 20.14 shows part of a regular n-gon with outer radius Ко, inner radius К; 
and edge length a,. Table 20.1 shows the relationship between the area, а,, К; and 


R, for different polygons. 


20.2.16 Circle 


The circumference C and area A of a circle are given by: 
C = nd =2лг 


А = лт? = -nd 
лғ 4” 


where the diameter d = 2r. 
An annulus is the area between two concentric circles as shown in Fig. 20.15, 


and its area A is given by: 
А=л (R -r°)= дя (D-4) 


where D = 2R and d = 2r. 
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Table 20.1 The area А,, edge length a; , inner radius R;, and outer radius Ко for different polygons 
п аһ = 2; tan(180°/n) Ri = Rocos(180°/n) RŽ = R? + 142 
п А, = 142 cot(180°/n) А, = $R2sin(360°/n) A, = nR? tan(180°/n) 


5 as=2RiV5S—-2V5 Е; = (4/5 +1) Ко = Ri( 5 — 1) 
2 
5 As = 2У25--10У5 As = 3R2V10+2V5 As = 5RI/5 – 24/5 


6 ag = 3:3 Ri = 5/3 Ro = $ Ri v3 
6 Ав = зау/3 Ав = 3 R23 Ав =28?У3 


8 ap —2R(2—1) ЕИ ГЕ Е 
8 Ав = 248 (2+1) Ag = 22,2 Ag = ВЕ? (V2 - 1) 


10 ajo = 2RiV 25 – 10/5 В, = &V/10--2/5 А, = 8,50- 10/5 
10 Ajo = žav 5+ 2V5 Ajo = 3R2V10 – 245 Ai = 28225 — 10/5 


Fig. 20.15 Ап annulus 
formed from two concentric 
circles 


А 
Ф” 


Fig. 20.16 А sector of a 
circle defined by the angle a 


Figure 20.16 shows a sector of a circle, whose area is given by: 


о 


3609 


лг? 
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Fig. 20.17 А segment of a 
circle defined by the angle о 


Figure 20.17 shows a segment of a circle, whose area is given by: 
1, 3 5 : 
А = 2 (a — sing), where a is in radians. 


The area of an ellipse with major and minor radii a and b is: 


А = zab 


20.3 2D Analytic Geometry 


In this section we briefly examine familiar descriptions of 2D analytic geometry 
and ways of computing intersections. 


20.3.1 Equation of a Straight Line 


The well-known equation of a straight line is: 
y=mx+c 


where m is the slope and c the intersection with the y-axis, as shown in Fig. 20.18. 
This is called the normal form. 

Given two points (хі, yi) and (xo, y2) we can state that for any other point 
(x, y): 


У 2—M 
х= ХІ X2 — X1 
which yields: 
y2 — У 
X2 ХІ 


у= (х= ху) + у 
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Fig. 20.18 The normal form Y, 
of the straight line 


Fig. 20.19 The orientation 
of a line can be controlled by 
a normal vector n and a 
distance d 


The main problem with the normal form is that it can not describe vertical lines, and 
requires the general form: 
ах +by+d=0 


As we shall see, this equation possesses some interesting qualities. 


20.3.2 The Hessian Normal Form 


The Hessian normal form is named after the German mathematician Ludwig Otto 
Hesse (1811-1874). Figure 20.19 shows this type of a line whose orientation is 
controlled by a normal unit vector n = [a ЫТ. If P(x, y) is any point on the line, 
then p is a position vector where р = [x y] and d is the perpendicular distance 
from the origin to the line therefore: 


= COS Q 


d 
Ірі 
апа 


d = ||p|| cosa 
But the dot product n - p is given by: 


n- p = ||n||||p]| cosa = ax + by 
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which implies that: 
ax + by = Ат] 


and because ||n|| = | we can write: 
ах +by—d=0 


where (x, y) is a point on the line, a and b are the components of a unit vector 
normal to the line, and d is the perpendicular distance from the origin to the line. The 
distance d is positive when the normal vector points away from the origin, otherwise 
it is negative. For example, let’s find the equation of a line whose normal vector is 
[3 АТ and the perpendicular distance from the origin to the line is 1. 

We begin by normalising the normal vector to its unit form. Therefore: if n = 
[3 АП, |n| = 43? 4- 42 = 5. 
The equation of the line is: 


Sup seg 
ЕЕ Aa d ex 
сас, 


Similarly, let’s find the Hessian normal form of у = 2x + 1. Rearranging the 
equation we get: 
2х-у--і 


which gives а negative distance. If we want the normal vector to point away from 
the origin we multiply by —1: 


—2x+y—-1=0 


Normalise the normal vector to a unit form: 


ie. V(-2? +12 =V5 


therefore: the perpendicular distance from the origin to the line, and the unit normal 
vector are respectively: 


Lo a | -2 1 | 
V5 М5 v5 
As the Hessian normal form involves a unit normal vector, we can incorporate the 
vector’s direction cosines within the equation: 
xcosa+ysina—d=0 


where a is the angle between the normal vector and the x-axis. 
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Fig. 20.20 The Hessian Y, 
normal form of the line 
equation partitions space into 
two zones 


axz+by—d>0 


ax +by—d<0 


20.3.3 Space Partitioning 


The Hessian normal form provides a very useful way of partitioning space into two 
zones: the partition that includes the normal vector, and the opposite partition. This 
is illustrated in Fig. 20.20. 
Given the equation: 
ах +by—d=0 


a point (x, y) on the line satisfies the equation. But if the point (x, y) is in the 
partition in the direction of the normal vector, it creates the inequality: 


ах +by—d>0 
Conversely, if (x, y) is in the partition opposite to the direction of the normal vector 
creates the inequality: 

ах +by—d<0O 
This space-partitioning feature of the Hessian normal form is useful in clipping lines 


against polygonal windows. 


20.3.4 The Hessian Normal Form from Two Points 


Given two points (хі, у) and (x2, уз) we compute the values of a, b and d for the 
Hessian normal form as follows. 
The vector joining the two points is v = |А, Ду] where: 


Ду = X2 — х 


Ау = у — у 


ПУ = (A? + 42 
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The unit vector normal to v is n = [- 4, A‘]", where: 


А 
ШИП 
йы 
1м 


therefore: let p = [х y]! be any point on the line, and using the Hessian Normal 
Form, we can write: 


n-p= -—— d +A y= =A + Д, у 


and 
— Aix ALy + (Аж Ду) =0 (20.1) 


For example, given the following points: (хі, уі) = (0, 1) and (x2, y2) = 
(1, 0); then Д, = 1/4/2 and А’, = —1/¥2. therefore: using (20.1): 


х у —1 Lx. 

Fatt ri ы 
ЖЕ ee er 
42. 42 4/2 


20.4 Intersection Points 


20.4.1 Intersecting Straight Lines 


Given two line equations of the form: 


ax 4 by di = 0 
ax + by -- do = 0 


the intersection point (x;, y;) is given by: 


bid» — bod, 

Xj = —————— 
a1 b» = ай 

р dia = Фа 


i= 


ay bz — abı 
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or using determinants: 


bi di 
by d, 
a, bi 
аз b 
di ај 
d» a2 
a1 bi 
a» bz 


If the denominator is zero, the equations are linearly dependent, indicating that there 
is no intersection. 


20.4.2 Intersecting Line Segments 


We are often concerned with line segments, as they represent the edges of shapes 
and objects. So let’s investigate how to compute the intersection of line segments. 
Figure 20.21 shows two line segments defined by their end points РІ, Po, Рз, P4 
and respective position vectors ру, p». рз and p4. We can write the following vector 
equations to identify the point of intersection: 


Pi = р +t — р!) (20.2) 
pi = рз + 5(p4 — рз) (20.3) 


where parameters s and t vary between 0 and 1. For the point of intersection, we can 
write: 


ри +/(р — p) = ps + 5(р4 — рз) 


Fig. 20.21 Two line 
segments with their 
associated position vectors 


x 
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therefore: the parameters s and f are given by: 


= t = 
_ (р: — рз) + t(P2 — рі) (20.4) 
ра — рз 
‚ — (93 — Pi) +s (P4 — рз) (20.5) 
p2 — Pi 


From (20.5) we can write: 


- (хз = х1) + s(x4 — x3) 


X2 — X1 
= (уз — yi) + 5004 — уз) 
У — У 


Let’s find 5: 


(x3 - хі) +504 — хз) _ O3 — y1) +504 — уз) 
X2—2X1 J2 == у] 
(y2 — ут) (хз — xi) + 5(х4 — хз)(уз — y1) = (х2 — х1)03- y) + 504 — уз)(Х2 — x1) 
8(х4 — хз)(у2 — y1) — s4 — уз) (хо — x1) = (x2 — ху)(уз — y1) — 02 — у1)(Хз — хі) 


which yields: 


(x2 — xi) (ys — у) — (Хз — 2x1)» у) (20.6) 
(x4 — хз) (yo — у) — (x2 — х1)(у4 — уз) 


Similarly to find t: 


ps (x1 = хз) + t(x2 — x1) 


(x4 — x3) 
f= Qi — y3) t f» — У) 
(ул — уз) 


(x1 — x3) +t(x2 = x1) _ (уз уз) + 2 — y1) 
(x4 — x3) Е (ул — уз) 
(x1 — хз) (4 — уз) + оо — x1) a — уз) = (x4 — хз) (у — уз) + t (x4 — x3)(y2 — у) 
1002 — xi) ra — Уз) — t (x4 — х3) (02 — y1) = (x4 — х3) (у — уз) — (xı — хз) (4 — уз) 


which yields: 


p% хз)(у — уз) — (xı — x3)(y4 — уз) (20.7) 
(xo — xi) 4 — уз) — (x4 — x3)(y2 — у) 


Let’s test (20.6) and (20.7) with two examples to illustrate how the equations are 
used in practice. The first example demonstrates an intersection condition, and the 
second demonstrates a touching condition. 
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Figure 20.22a shows two line segments intersecting, with an obvious intersection 
point of (1.5, 0). The coordinates of the line segments are: 


(хі, у) = (1, 0), (х, y2) = (2, 0) 
(хз, уз) = (1.5, D, (ху, y4) = (1.5, 1) 


therefore: 
pe x1) (уз = у) — (хз — x1) (V2 — y) 
(x4 — X3)(Y2 — у) — (x2 — x1) (y4 — уз) 
006-0) — 0.5)00) 1 4, 
(0) (0) — (1) (2) —2 | 
апа 


_ ба ж) 07 уз) — Өл — х3) 04 — уз) 
(x2 — xi) — Уз) — Ga — хз) Ор — yi) 
_ OM) = (-0.5)0) _1_ 
(1)(2) — (0)(0) 2 
Substituting s and г in (20.2) and (20.3) we get (x;, yj) = (1.5, 0) as predicted. 


Figure 20.22b shows two line segments touching at (1.5, 0). The coordinates of 
the line segments are: 


(хі, у) = (1, 9), (хә, y2) = (2, 0) 
(хз, Уз) = (1.5, 0), (ха, y4) = (1.5, 1) 


therefore: 
г- 92 ху)(уз — y1) — (хз — x (2 — y) 
(x4 — х3) (0 — y1) — (x2 —o1)04 — уз) 
_ (1)(0) — (0.5)(0) M 0 _ 
(0)(0) — (1)(1) —1 
апа 


_ (4 — хз) Ол — уз) — xı — 43) 04 — уз) 
— (ху = x1) a — уз) — (ха — 3) (2 — у) 
_ (0)(0) — (—0.5)(1) _ 0.5 — 05 

(1)(1) — (0) (0) 1 


The zero value of s confirms that the lines touch, rather than intersect, and t = 0.5 
confirms that the touching takes place halfway along the line segment. 


410 20 Analytic Geometry 


Fig. 20.22 a Shows two line Ү, Y, 
segments intersecting 

b Shows two line segments 
touching 


(a) (b) 


20.5 Point Inside a Triangle 


We often require to test whether a point is inside, outside or touching a triangle. 
Let's examine two ways of performing this operation. The first is related to finding 
the area of a triangle. 


20.5.1 Area of a Triangle 


Let's declare a triangle formed by the anticlockwise points Pı (x1, yi), P2(x2, у) 
and P3(x3, уз) as shown in Fig. 20.23. The area of the triangle is given by: 


1 1 1 
А = (x2 — xi)3 — y1) 2 (2 xi)» = y1) 2 02 хз) (уз — y2) 2 03 xis — у) 


which simplifies to: 


1 
A= 51102 - уз) + x2(¥3 — у) + xai — »2)] 


and can be represented as a determinant: 


Fig. 20.23 The area of the У 
triangle is computed by 
subtracting the smaller 
triangles from the 
rectangular area 


20.5 Point Inside a Triangle ап 


Fig. 20.24 Тһе top triangle У 
has anticlockwise vertices, 
and the bottom triangle 
clockwise vertices 


1 ХІ У 
А = 2 X2 y 
X3 уз 1 


Figure 20.24 shows two triangles with opposing vertex sequences. If we calculate 
the area of the top triangle with anticlockwise vertices, we obtain: 


1 
A= 52-4 +34 2) +20 - 2)] =2 
whereas the area of the bottom triangle with clockwise vertices is: 
1 
А= 510-0) +30- 2) +20 - 2)] = –2 


which shows that the technique is sensitive to vertex direction. We can exploit this 
sensitivity to test if a point is inside or outside a triangle. 

Consider the scenario shown in Fig. 20.25, where the point Р, is inside the triangle 
(P 1, P. 2» Р 3). 


e Iftheareaoftriangle (Рі, P», Р,) 1$ positive, P, mustbetotheleftofthe (Ру, P2). 
e Iftheareaoftriangle (P2, Рз, P;)is positive, P, must be to the left of the (P2, P3). 
e Ifthe area of triangle (Рз, Pj, Р;) is positive, P, must be to the left of the (Рз, Pi). 


If all the above tests are positive, Р, is inside the triangle. Furthermore, if one area 
is zero and the other areas are positive, the point is on the boundary, and if two areas 
are zero and the other positive, the point is on a vertex. 

Let’s now investigate how the Hessian normal form provides a similar function. 
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Fig. 20.25 If the point P, is 
inside the triangle, it is 
always to the left as the 
boundary is traversed in an 
anticlockwise direction 


Fig. 20.26 The triangle is 
represented by three line 
equations expressed in the 
Hessian normal form. Any 
point inside the triangle is 
found by evaluating their 
equations 


20.5.2 Hessian Normal Form 


We can determine whether a point is inside, touching or outside a triangle by rep- 
resenting the triangle’s edges in the Hessian normal form, and testing in which 
partition the point is located. If we arrange that the normal vectors are pointing 
towards the inside of the triangle, any point inside the triangle will create a positive 
result when tested against the edge equation. In the following calculations there is no 
need to ensure that the normal vector is a unit vector, therefore (20.1) can be written: 


—Аух + Axy + (Aya — Дуу) = 0 


To illustrate this, consider the scenario shown in Fig. 20.26 where a triangle is 
formed by the points (1, 1), (3, 1) and (2, 3). With reference to (20.1) we compute 
the three line equations: 

The line between (1, 1) and (3, 1): 


A, =2 

A, =0 
-0хх+2ху-2х1=0 
25-2 = 0 
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The line between (3, 1) and (2, 3): 


Аұ--і 
Ay =2 
—2xx-Ixy+@x3+1x1)=0 
—2x-—y+7=0 
The line between (2, 3) and (1, 1): 
Аұ--і 
Ау--2 
2xx-1xy+(-2x2-1x3)=0 
2х-у-і-0 
Thus the three line equations for the triangle are: 
2y—2=0 
—2x-y+7=0 
2x—y—120 
We are only interested in the signs of the equations: 
2y -2 (20.8) 
-2х-у--7 (20.9) 
2x —y—1 (20.10) 


which can be tested for any arbitrary point (x, y). If they are all positive, the point is 
inside the triangle. If one expression is negative, the point is outside. If one expression 
is Zero, the point is on an edge, and if two expressions are zero, the point is on a 
vertex. 

Just as a quick test, consider the point (2, 2). The three expressions (20.8) to 
(20.10) are positive, which confirms that the point is inside the triangle. The point 
(3, 3) is obviously outside the triangle, which is confirmed by two positive results 
and one negative. Finally, the point (2, 3), which is a vertex, creates one positive 
result and two zero results. 


20.6 Intersection of a Circle with a Straight Line 


The equation of a circle has already been given in an earlier chapter, so we will now 
consider how to compute the intersection of a circle with a straight line. We begin 
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Fig. 20.27 The intersection Y, 


of a circle with a line у-2-1 


by testing the equation of a circle with the normal form of the line equation: 


х + у? = р? and у= тх +с 


By substituting the line equation in the circle’s equation we discover the two inter- 
section points: 


—mc+/r2(1 + т?) — с? 


= 20.11 

X12 am ( ) 
с +myr?(1 + m?) — c? 

= 20.12 

yı,2 imi ( ) 


Let's test this result with the scenario shown in Fig. 20.27. Using the normal form 
of the line equation we have: 


у=х+1 m=1, and c=1 
Substituting these values in (20.11) and (20.12) yields: 
X127 —1, 0, у2=0, 1 
The actual points of intersection are (—1, 0) and (0, 1). 
Testing the equation of the circle with the general equation of the line ax + by 


+c = 0 yields intersections given by: 


—ас + byr? (a? + b?) — с? 


дат эр (20.13) 
—bc + ayr? (a? + b?) — с? 
У2 = (20.14) 


a? + p? 
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The general form of the line equation y = x + 1 is: 
х-у+1= 0 where a—l,b—-—1 and c=1 
Substituting these values in (20.13) and (20.14) yields: 
Xi2— —1, 0, and у2=0, 1 
which gives the same intersection points found above. 


Finally, using the Hessian normal form of the line ax + by — d = 0 yields inter- 
sections given by: 


хо = ad E byr? – а? (20.15) 
уо = bd + ayr? – а? (20.16) 


The Hessian normal form of the line equation x — y + 1 = 015: 


—0.707x + 0.707у — 0.707 = 0 


where а © —0.707, b © 0.707 апа d ~ 0.707. Substituting these values in (20.15) 
and (20.16) yields: 
Х1,2 = —1, 0 and y12 = 0, 1 


which gives the same intersection points found above. One can readily see the compu- 
tational benefits of using the Hessian normal form over the other forms of equations. 


20.7 3D Geometry 


3D straight lines are best described using vector notation, and readers are urged 
to develop strong skills in these techniques if they wish to solve problems in 3D 
geometry. Let's begin this short survey of 3D analytic geometry by describing the 
equation of a straight line. 


20.7.1 Equation of a Straight Line 


We start by using a vector b to define the orientation of the line, and a point a in space 
through which the line passes. This scenario is shown in Fig. 20.28. Given another 
point P on the line we can define a vector tb between a and Р, where t is a scalar. 
The position vector p for P is given by: 
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Fig. 20.28 The line 
equation is based upon the 
point a and the vector b 


p=a+tb 
from which we can obtain the coordinates of the point P: 


Xp = Xa + Ех 
Ур = Ya + typ 
Zp = Za d tZb 


For example, if b= [1 2 3]! and a = (2, 3, 4), then by setting t = 1 we can 
identify a second point on the line: 


Xp=2+1=3 
Yp =342=5 
1р-4-3-7 


In fact, by using different values of t we can slide up and down the line with ease. 
If we have two points P; and P», such as the vertices of an edge, we can represent 
the line equation using the above vector technique: 


P = р +/(р – р) 


where р; and р» are position vectors to their respective points. Once more, we сап 
write the coordinates of any point P as follows: 


Xp =X, + Е(х2 — x1) 
Ур = yı +t2— y1) 
Zp = 01 + Ё(22 — Z1) 
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20.7.2 Intersecting Two Straight Lines 


Given two straight lines we can test for a point of intersection, but must be prepared 
for three results: 


e A real intersection point. 
e No intersection point. 
e An infinite number of intersections (identical lines). 


If the line equations are of the form: 


p=a,+ rb, 
p=a+rb 


for an intersection we can write: 


ai + rb; = ao + 50 


which yields: 
Xal + ГХы = Хах + SXp2 (20.17) 
Yat “ҒУЫ = Уа + SYp2 (20.18) 
Zal + rZbi = Za2 + 5552 (20.19) 


We now have three equations in two unknowns, and any value of r and s must hold for 
all three equations. We begin by selecting two equations that are linearly independent 
(i.e. one equation is not a scalar multiple of the other) and solve for ғ and s, which 
must then satisfy the third equation. If this final substitution fails, then there is no 
intersection. If all three equations are linearly dependent, they describe two parallel 
lines, which can never intersect. 

To check for linear dependency we rearrange (20.17) to (20.19) as follows: 


ГХЬ — $Xp2 = Хал — Xal (20.20) 
rybi = SYp2 = Ya2 — Уа (20.21) 
К2ы — 5552 = 542 — Zal (20.22) 


If the determinant A of any pair of these equations is zero, then they are dependent. 
For example, (20.20) and (20.21) form the determinant: 


Хы —Хь2 
Уы — У 


which, if zero, implies that the two equations can not yield a solution. As it is impos- 
sible to predict which pair of equations from (20.20) to (20.22) will be independent, 
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let’s express two independent equations as follows: 


га = 5а2 = by 


raz, — 5а22 = bz 


which yields: 
anbı — аЬ» 
A 
_ anbi = anb 
A 
where: 
A= аш (412 
ау 922 


Solving for ғ апа 5 we obtain: 


РЕ уы (Ха? — Ха) — Xp2 (Уа? — Var) 
Хы Уь2 — Уь1Хь2 

dies Уы (Ха — Ха) — Хь (Уа — Yai) 
Xb1Yb2 — YbiXp2 


(20.23) 


(20.24) 


As a quick test, consider the intersection of the lines encoded by the following 
vectors: 


0 3 0 2 
а-|1|, М-|3|, а= [0.5 |, b=] 3 
0 3 0 2 


Substituting the х- and y-components in (20.23) and (20.24) we discover: 


but for these to be consistent, they must satisfy the z-component of the original 
equation: 
ГЫ — 52,2 = Za2 — Zal 


1 1 
~x3-~x2=0 
3 2 


which is correct. Therefore, the point of intersection is given by either: 


p; =a;+rb;, or 
р; = a» + sho 
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Let’s try both, just to prove the point: 


1 1 
j—0 -3- 1, і-- 0 -2--1 
X +3 x +5 
в ge о 
DIS e МЕЕ pe 


1 1 
ЖЕ oe шш 22-1 
2 Boris 2 0+; 


therefore: the point of intersection point is (1, 2, 1). 
Now let’s take two lines that don’t intersect, and also exhibit some linear depen- 


2 0 2 
а = 1 Е b; = 2 ‚ а= 2 P b = 2 
0 0 1 


Taking the x- and y-components we discover that the determinant A is zero, which 
has identified the linear dependency. Taking the y- and z-components the determinant 
is non-zero, which permits us to compute ғ and s using: 


ym 552 Ya2 e: Yat) PE Уь (Za2 ws Zal) 
Yb1Zb2 — Zb1 Уь2 
ge 2ы Уа? = Уа) E. Уы Zaz p Zal) 


Yb1Zb2 — Zb1 Yb2 
|.12-1)-2(00-0) 1 


r= 


2х1-0х2 2 
0(2 — 1) — 2(0 — 0) 
= =0 
2x1-0x2 


But these values of r and s must also apply to the x-components: 
rXbi — SXp2 = Xa2 — Xal 
1 
2 x2—0x2720 


which they clearly do not, therefore the lines do not intersect. 
Now let's proceed with the equation of a plane, and then look at how to compute 
the intersection of a line with a plane using a similar technique. 


20.8 Equation of the Plane 


We now consider four ways of representing a plane equation: the Cartesian form, 
general form, parametric form and a plane from three points. 
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Fig. 20.29 The vector n is 
normal to the plane, which 
contains a point Ро. P is any 
other point on the plane 


20.8.1 Cartesian Form of the Plane Equation 


One popular method of representing a plane equation is the Cartesian form, which 
employs a vector normal to the plane’s surface and a point on the plane. The equation 
is derived as follows. 

Let п be a nonzero vector normal to the plane and Ро(хо, yo, zo) a point on 
the plane. P(x, y, z) is any other point on the plane. Figure 20.29 illustrates the 
scenario. 

The normal vector is defined as: 


n= аі + bj + ck 
апа the position vectors for Ру апа Р аге: 


ро = xoi + yoj + zok 
p = xi + у] + zk 


respectively. From Fig. 20.29 we observe that: 


q =P- ро 
and as n is orthogonal to q: 
n-q=0 
therefore: 
п. (p— po) =0 


which expands into: 
n-p=N-Ppo (20.25) 
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Writing (20.25) in its Cartesian form we obtain: 
ах + by + cz = axo + byo + cz 

but axo + byo + czo is a scalar quantity associated with the plane and can be replaced 
by d. Therefore: 

ax+by+cz=d (20.26) 
which is the Cartesian form of the plane equation. 

The value of d has the following geometric interpretation. 

In Fig. 20.29 the perpendicular distance from the origin to the plane is: 


h = рої созо 


therefore: 
n · ро = |про! cose = й п] 


therefore: the plane equation is also expressed as: 
ах + by + cz = h||n|| (20.27) 
Dividing (20.27) by ||n|| we obtain: 


a 4 b " с 
x y LS 
Іһ| п | Іһ| 


where: 


| = Va? + b? + с? 


This means that when a unit normal vector is used, h is the perpendicular distance 
from the origin to the plane. Let's investigate this equation with an example. 

Figure 20.30 shows a plane represented by the normal vector n = j +k and a 
point on the plane Ро(0, 1, 0). Using (20.26) we have: 


Ox+ly+iz=0x04+1x14+1x0=1 


therefore: the plane equation is: 
У+==1 


If we normalise the equation to create a unit normal vector, we have: 


y 2 1 


JA VG XS 


where the perpendicular distance from the origin to the plane is 1/4/2. 
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Fig. 20.30 А plane yt 
represented by the normal 

vector n and a point A 

Po(0, 1, 0) A В (0,1,0) 


20.8.2 General Form of the Plane Equation 


The general form of the plane equation is expressed as: 
Ax+ By+Cz+D=0 
which means that the Cartesian form is translated into the general form by making: 


А-а, B=b, С=с, Р--а 


20.8.3 Parametric Form of the Plane Equation 


Another method of representing a plane is the parametric form which employs two 
vectors and a point that lie on the plane. Figure 20.31 illustrates a scenario where 
vectors a and b, and the point T (x+, y;, z+) Пе on a plane. We now identify any other 
point on the plane P(x,, yp, Zp) with its associated position vector p. The point T 
also has its associated position vector t. 


Fig. 20.31 A plane is Y 
defined by the vectors a and 
b and the point T (xr, Yt, Zr) 
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Fig. 20.32 The plane is я 
defined by the vectors a and 
b and the point T(1, 1, 1) d 


Ж 
2 
x 
Using vector addition we can write: 
с = Ла + eb 
where A and є are two scalars such that c locates the point P. We can now write: 
р=+с (20.28) 


Therefore: 


Хр = X; + Аха + ех 
Ур = yr Ауа + €yb 
Zp = + Аа + ЕЗЬ 


which means that the coordinates of any point оп the plane are formed from the 
coordinates of the known point on the plane, and a linear mixture of the components 
of the two vectors. Let’s illustrate this vector approach with an example. 

Figure 20.32 shows a plane containing the vectors a = i and b = К, and the point 
T(1, 1, 1) with its position vector t = i+ j + К. By inspection, the plane is parallel 
with the xz-plane and intersects the y-axis at y = 1. 

From (20.28) we can write: 


p=t+aAa+eb 


where А. апа є are arbitrary scalars. 
For example, if à = 2 and є = 1: 


xp =14+2x14+1x0=3 
yp =1+2x0+4+1x0=1 
z;2142x041x122 


therefore: the point (3, 1, 2) is on the plane. 
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20.8.4 Converting from the Parametric to the General Form 


It is possible to convert from the parametric form to the general form of the plane 
equation using the following formulae: 


_ (a-b)(b- t) – (a-OllbI" 
_ РР — (a - b? 
_ (a-b)(a-t)— b: 01а? 
_  аРЫР — (a - b)? 


The resulting point P(x,, Ур, Zp) is perpendicular to the origin. 
If vectors a and b are unit vectors, A and є become: 


Te (a-b)(b-t)—a-t 


rcu pu (20.29) 


P's position vector p is also the plane's normal vector. Therefore: 


Xp = X; + Аха + €xp 
Ур = yr + Aya + еу 
Zp = Zt + Аа + €zp 


The normal vector is: 
p = хі + ypj + 2 


and because ||p|| is the perpendicular distance from the plane to the origin we сап 


state: 
Хр Ур <p 


x + yr 
Ip — lp Ip 


or in the general form of the plane equation: 


z = |р| 


Ax+ By+Cz+D=0 


where: x y Е 
Ірі Ірі Ірі 


Ав ап example, Fig. 20.33 shows a plane inclined 45° to the y- and z-axis and 
parallel with the x-axis. The vectors for the parametric equation are: 
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Fig. 20.33 The vectors а 
and b are parallel to the 
plane and the point (0, 0, 1) 
is on the plane 


а-і-Е 
b=i 
t=k 


Substituting these components in (20.29) and (20.30) we have: 


_ (0)) = (=) хт. 


m 2x1- (0) = 99 
.,0(-D-0 x2 _ 
ЕРГЕН ТЕН 


therefore: 


x, =0+05x0+0x1=0 
yp =0+0.5х1+0х0= 0.5 
zp = 140.5 x (CI) -0x 0— 0.5 


The point (0, 0.5, 0.5) has position vector p, where: 


2 
рі = /02 + 0.52 + 0.52 = ue 


the plane equation is: 


Ox + + 
Jin 


which simplifies to: 
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Fig. 20.34 The vectors used 
to determine a plane 
equation from three points 
К, Sand T 


20.8.5 Plane Equation from Three Points 


Very often we need to find the plane equation from three points. To begin with, the 
three points must be distinct and not lie on a line. Figure 20.34 shows three points R, 


. => — 
S and T, from which we create two vectors u = RS and v = RT. The vector product 
u x v then provides a vector normal to the plane containing the original points. We 


now take another point P(x, y, z) and form a vector w — RP. The scalar product 
у. (и x v) = Oif P is in the plane containing the original points. This condition 
can be expressed as a determinant and converted into the general equation of a plane. 
The three points are assumed to be in an anticlockwise sequence viewed from the 
direction of the surface normal. 


We begin with: 
i j К 
UXV-—|Xy Yu Zu 
Xv Ww £v 


As w is perpendicular to u x v: 


у. (иху) = Хи Уи Zu =0 


Expanding the determinant we obtain: 


Zu Xu 
Zw Xv 


Yu Zu 


Xw 
Yv £v 


+ Уш 


20.9 Intersecting Planes 
which becomes: 


Ys — YR 


X—X 
( к) YT — YR 


ZS — ZR ZS —ZR Xs — XR 
+ (у — yr) 

ZT — ZR ZT — ZR XT — XR 

Xs XR YS YR 

+ < — Zr) T 

XT = Хк YT — Ук 


This can be arranged in the form ax + by + cz + d = О where: 


=0 
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a= У8-Ув 45 — ZR p= ZS— ZR Х5- XR 
Ут — YR £T — ZR | XT —ZR XT — XR 
Xs Хв YST Ук 
c= , а= –(ахв + byr + cz 
кеде ete (axr + Бук к) 
ог 
l ук ZR XR l ZR XR YR 
a=|1 ys 25|, b= |xs 1 zs|, c=ļ|xs ys 1 
І yr zr xp l zr хг Yr 


d = —(ахв + Був + сів) 


As an example, consider the three points К (0, 0, 1), S(1, 0, 0), Т(0, 1, 0). 
therefore: 


1 0 4 0 1 0 0 1 
а-|1 0 021, b=|1 ol=1, с=|1 0 1|=1 
110 0 1 0 0 1 1 

d=-(1x041x041x1)=-1 


and the plane equation is: 
x+y+z—-1=0 


20.9 Intersecting Planes 


When two non-parallel planes intersect, they form a straight line at the intersection, 
which is parallel to both planes. This line can be represented as a vector, whose 
direction is revealed by the vector product of the planes’ surface normals. However, 
we require a point on this line to establish a unique vector equation; a useful point is 
chosen as Ро, whose position vector po is perpendicular to the line. 

Figure 20.35 shows two planes with normal vectors n, and пә intersecting to 
create a line represented by n5, whilst Po(xo, yo, Zo) is a particular point on п» and 
P(x, y, Z) is any point on the line. 
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Fig. 20.35 Two intersecting Y, 
planes create a line of mu 12 
intersection Ё. і 


We start the analysis by defining the surface normals: 


n; = ай + bij + сік 
Ny = mi + ој + ok 


next we define р and ро: 


p=xi+ yj + 2К 
ро = xoi + yoj + zok 


Now we state the plane equations in vector form: 


n;-p+d,=0 
п2.р+ 4 = 0 


The geometric significance of the scalars d; and d» has already been described above. 
Let’s now define the line of intersection as: 


р = po + Ап; 


where А. is a scalar. 
As the line of intersection must be orthogonal to n; and n»: 


пз = азі + 63] + сзК = n; х No 


Now we introduce Ру as this must satisfy both plane equations, therefore: 


n; : po = —di (20.31) 
п · po = —4 (20.32) 


and as Ру is such that po is orthogonal to na: 


ns - po = 0 (20.33) 
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Equations (20.31)-(20.33) form three simultaneous equations, which reveal the 
point: Ро These are represented in matrix form as: 


—di а by c Xo 
-Ф|=| о № о Уо 
0 аз Ёз c3 20 
ог 
а a b с Xo 
d |=-| о b c Yo 
0 аз Ёз сз 20 
therefore: 
Xo _ Уо _ 20 c. ==] 
di bi СІ aj а СІ aj bi di DET 
d» b» C2 a2 d» C2 a» b» d» 


0 b3 C3 a3 0 C3 a3 Рз 0 


which enables us to state: 


bi c b о 
а -а 
2 Рз C3 : Рз C3 
хо = 
DET 
dy аз C3 _ а аз сз 
а] сі аз © 
m- DET 
ај bi a2 bz 
d аз b| Е. аз b3 
= РЕТ 
where: 
aj bi СІ 
DET= a» b2 С? 
аз Рз C3 
The line of intersection is then given by: 
р = ро + Арз 


If DET = 0 the line and plane are parallel. 
To illustrate this, let the two intersecting planes be the xy-plane and the yz-plane, 
which means that the line of intersection will be the y-axis, as shown in Fig. 20.36. 
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Fig. 20.36 Two intersecting 
planes creating a line of 
intersection coincident with 
the y-axis 


nı k 
n, =i 
and dı = Ф = 0. 
We now compute n3, DET, xo, yo, zo: 
i j k 
і-|0 0 | =j 
1 0 0 
0 0 1 
DET =|1 0 0|-і1 
0 1 0 
1 0 0 
B Ug. 0 
хо = 1 =0 
0 0 0 
0 01 =0 10 
Уо = 1 = 0 
0 10 
Оз x| 9e л 
Zo = 1 = 0 


therefore: the line equation is р = Ans, where пз = j, which is the y-axis. 


20.9.1 Intersection of Three Planes 


Three intersecting planes will intersect at a point as shown in Fig. 20.37, and we 
can find this point by using a similar strategy to the one used in two intersecting 
planes by creating three simultaneous plane equations using determinants. 
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Fig. 20.37 Three mutually 
intersecting planes 


Figure 20.37 shows the common point P(x, у, z). The three planes can be defined 
by the following equations: 


ayx +biy+cz+d =0 
ax + biy - coz + Ф =0 
азх + у + сзз + dz = 0 


which means that they can be rewritten as: 


-( aj bi СІ X 
-Ф|-|а b» о y 
-Ф аз Ёз сз 2 
ог 
а а b с х 
d | =-| о № о y 
ау аз Рз C3 2 
ог in determinant form: 
х _ у _ 2 ES. 
di bı СІ я aj di СІ п ay b, di — DET 
d» bo C3 a2 d» C2 a» bz d» 
di 8 сз аз 4; сз аз ӛз а 
where: 
aj b, СІ 
DET = |а № о 
аз Рз C3 
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therefore: we can state that: 


d bi с 
dy b о 
dy b3 сз 
Jesu Жы у! 
РЕТ 
a d с 
a2 d» с? 
аз ds сз 
T€ ВЕР 
a by di 
a, b d, 
аз з а, 
= РЕТ 


If DET = 0, two of the planes at least, are parallel. Let’s test these equations 
with a simple example. 
The planes shown in Fig. 20.38 have the following equations: 


х+у+2-2 = 0 


z=0 
у—1=0 
therefore: 
1 1 1 
DET=|0 0 1'—-1 
0 1 0 


Fig. 20.38 Three planes 
intersecting at a point P 


20.9 Intersecting Planes 


—2 1 

оо 

ы 4 
Е 

ZI 

1 —2 

оо 

б <1 

МЕЕ 

i 

0 0 

0 1 

mem cl 


O = 


O = 


which means that the intersection point is (1, 1, 0), which is correct. 


20.9.2 Angle Between Two Planes 
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Calculating the angle between two planes is relatively easy and can be found by 
taking the dot product of the planes’? normals. Figure 20.39 shows two planes with 
о representing the angle between the two surface normals n, and п). 


Let the plane equations be: 


аж + by, + са +d; =0 
ax + Бу + ст + Ф = 0 


therefore: their surface normals are: 


n; = aji + bij + cık 
m = mai + bj + ck 


Fig. 20.39 The angle 
between two planes is the 
angle between their surface 
normals 


ny 


n2 
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Fig. 20.40 o is the angle 
between two planes 


Taking the dot product of n; and no: 


n; 02 = ||n; || |[nz|| cosa 


= ( 01: 02 ) 
a = COS ---- 
ШЕШ 


For example, Fig. 20.40 shows two planes with normal vectors n; and no. 
The plane equations are: 


and 


x+y+z-1=0 


z=0 
therefore: 
n; =i+j+k 
n, = k 
therefore: 
іші = 3 
[no]| = 1 
and 


1 
—1 © 
a = COS — | А 54.74 
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Fig. 20.41 o is the angle и 
between the plane’s surface 
normal and the line’s 
direction vector 


20.9.3 Angle Between a Line and a Plane 


The angle between a line and a plane is calculated using a similar technique used 
for calculating the angle between two planes. If the line equation employs a direction 
vector, the angle is determined by taking the dot product of this vector and between 
the plane’s normal. Figure 20.41 shows such a scenario where n is the plane’s surface 
normal and у is the line’s direction vector. 

Let the plane equation be: 


ax+by+cz+d=0 


then its surface normal is: 
n = ai + bj + ck 


Let the line's direction vector be v and T(x; y;, z;) is a point on the line, then any 
point on the line is given by the position vector p: 


p=t+Av 


therefore: we can write: 


= 
< 
Ш 


In|] ІУІ сова 


“| n-v ) 
a = cos ---- 
|| ПУ! 


When the line is parallel to the plane n - у = 0. 
Consider the scenario illustrated in Fig. 20.42 where the plane equation is: 


x+y+z—-1=0 


therefore: the surface normal is given by n: 
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Fig. 20.42 Тһе required MA 
angle is between n and a 1 | 
a 
| 
1 1 
2 X 
n=i+j+k 


and the line’s direction vector is a: 


a=i+j 
therefore: 

| = v3 

lal = v2 
and 


20.9.4 Intersection of a Line with a Plane 


A line and a plane may intersect, or not, if they are parallel. Either way, both 
conditions can be found using some simple vector analysis, as shown in Fig. 20.43. 
The objective is to identify a point P that is on the line and the plane. 

Let the plane equation be: 


ax+by+cz+d=0 


where: 
n = ai + bj + ck 
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Fig. 20.43 Тһе vectors и 
required to determine 

whether a line and plane 

intersect T 


P is a point on the plane with position vector: 


p = xi + yj + {К 


therefore: 
n-p+d=0 
Let the line equation be: 
p=t+Av 
where, 
= хі + yj + zk 
and 


y = xy + y,j+ zk 
therefore: the line and plane will intersect for some A such that: 
n-(t+Av)+d=n-t+An-v+d=0 


therefore: 


= ылы. 
n-v 


for the intersection point. The position vector for P is р = t + Av. 


If n - v = 0 the line and plane are parallel. 
Let's test this result with the scenario shown in Fig. 20.44. 
Given the plane: 


x+ty+z—-1=0 
n=i+j+k 
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Fig. 20.44 P identifies the 
intersection point of the line 
and the plane 


and the line: 


p=t+Av 
where: 
t=0 
v=i+j 
then: 


-(1х0--1х0--1х0-І1 
ы Se ы 
1x1l+1x1+1x0 


and the point of intersection is P(0.5, 0.5, 0). 


20.10 Summary 


Mixing vectors with geometry is a powerful analytical tool, and helps us solve many 
problems, encountered in computer science. Unfortunately, there has not been space 
to investigate every topic, but hopefully, what has been covered, will enable the reader 
solve other problems with greater confidence. 


Chapter 21 A) 
Eigenvectors and Eigenvalues cheret 


21.1 Introduction 


This chapter introduces the idea of an eigenvector, eigenvalue and the characteristic 
equation. The chapter concludes with a worked example. 


21.2 A Linear Transform 


A linear transform has two qualities: it preserves the position of the origin, and all 
grid lines remain equidistant apart and parallel. We discover that linear transforms 
can treat vectors differently, which reveals the existence of eigenvectors, with a 
corresponding eigenvalue. The German word eigen means characteristic, own, 
latent or special. 

To illustrate an eigenvector and eigenvalue, consider Fig. 21.1. It shows two points 
(—1, 1) and (4, 1), and two other points, (1, —1) and (—4, —1), diametrically 


opposite the first two. The two vectors | 1 


| апа Б аге the eigenvectors of the 


associated square matrix: with eigenvalues 1 and 6. 


4 

12 

The reason for identifying the points or vectors is as follows. For example, a point 
(x, y) is the result of applying the square matrix: 


ЕЗІНІН 
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Fig. 21.1 Four special points belonging to a linear transform 


But the eigenvectors are special, as they can either be transformed by the square 


1 | can either be 


matrix, or scaled by the eigenvalue. For example, the eigenvector | 


transformed to H or scaled by the eigenvalue: 


БЕІН 


^ 9-4 
€ ЕНІН 


We begin the search for eigenvectors, by first identifying the eigenvalues that 
create them. The square matrix is chosen carefully in these explanations, to create 
simple eigenvalues, such as whole numbers. But they can easily be any real or 
complex number, and the associated algebra, is complex. 

We can define an eigenvector and its eigenvalue as follows. Given a linear trans- 
form іп a square matrix A, a non-zero vector v is an eigenvector, and л is the corre- 
sponding eigenvalue if: 


21.2 A Linear Transform 


Av = Av 
Equation 21.1 can be written as: 
Ау —Aav=0 
and in matrix form: 
(A —ADv— 0 
As v is a non-zero vector: 
A-AI=0 


Equation (21.2) is called the characteristic equation of a square matrix. 
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(21.1) 


(21.2) 


Consider the 2D transform T that maps the point (x, y) to (ax + by, сх + dy): 


T (x, y) -> (ах + by, сх ғау) 


This is expressed in matrix form as: 
T: vr Ау 


or 


СЕЙ 


where: 


Ау = Av 
ab||x 
СЕЕ 
or 
ax + by = Ax 
сх + dy = Xy 


Rearranging, we have: 


| 


(a—A)x + by =0 
сх + (d—-A)y 20 
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а, 


For а non-zero [x у]? to exist, we must have: 


or back in matrix form: 


а-л b 
с а-х 


^ 


which is the characteristic equation. 
Let's use this on the transform we used in Fig. 21.1: 


ІНІНЕН 


ГТ 4 


Тһеп: 


1 Фе 
(5—2)2—2)—420 
10—74+47-4=0 
А2 —7A+6=0 

(A —6)(A— 1) =0 


- 


Therefore, А = 6 and A = 1, are the two eigenvalues. 
Next, we substitute the two values of A in: 


5— 4 x| |0 
1 2—А||у| |0 
to extract the eigenvectors. Let's start with А = 6: 
—1 4 х 0 
EH _ 
Equation (21.3) represents algebraically: 


-х--4у-0 
х= 4у = 0 


21.2 A Linear Transform 443 


which is simplified by adding the bottom row to the top row in (21.3): 


(1% ИН E || (21.4) 


Equation (21.4) represents x — 4y = 0, which is solved by letting y = f, therefore: 


which is represented by: 


Therefore, with т = 1, the eigenvector is: 


H 
m H Е В (21.5) 


Equation (21.5) represents algebraically: 


Next, we substitute А = 1: 


4x+4y=0 
x+y=0 


which is simplified by subtracting 4 times the bottom row to the top row in (21.5): 


il H " В (21.6) 


Equation (21.6) represents x + у = 0, which is solved by letting у = 1, therefore: 


which is represented by: 
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Therefore, with т = 1, the eigenvector is: 1 


Therefore, the matrix : 2| has eigenvectors ЕМ апа B , with associated 


eigenvalues of A = 1 and à = 6. 
A square matrix can be any size, with zero or any number of eigenvectors, and 
strategies for solving such problems are often found on the internet. 


21.3 Complex Vectors 


A characteristic equation may have real or complex solutions, and if they are complex, 
there are no real eigenvectors. For example, we would not expect the following 
rotation transform to have any real eigenvectors, as this would imply that it shows 
a rotational preference to certain points. Let's explore this transform to see how the 
characteristic equation behaves. 


R= Ее А 


~ | sing cos В 
The characteristic equation is: 


с0$(В) — А -—sinf 
sinf  cos(8) — X 


p 
where В is the angle of rotation. Therefore: 


(соз(В) — А)? + sin? 8 = 0 
А2 — 2A cos B + cos? B + sin? В = 0 
A? —2rcosB+1=0 


This quadratic in A is solved using: 


a —b + Jb? — 4ac 
Е 2а 


where a = 1, b = –2с05 8, с= 1: 
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№ = cos В + isin В 


А2 = cos В — isin В 
The corresponding complex eigenvectors are: 


fl 
- 


The major problem with the above technique is that it requires careful analysis to 
untangle the eigenvector, and ideally, we require a deterministic algorithm to reveal 
the result. 


21.4 Summary 


Hopefully, the above examples have shown you the existence of eigenvectors, and 
their associated eigenvalue. 


21.5 Worked Example 


21.5.1 Eigenvectors and Eigenvalues 


Find the eigenvectors and eigenvalues for the transform using the characteristic equa- 


tion: 
12 
43 


and show that the eigenvectors can be transformed and scaled by the eigenvalue. 


Solution 
Using the characteristic equation we have: 


(1-2)3-2)-8-0 
3-4.-22-8-0 
2-4.-5-0 

(А+ 1)(А – 5) = 0 


therefore А = —1 anda = 5. 
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Next, we substitute the two values of A in: 


КЕРІНЕНЕ 


to extract the eigenvectors. Let's start with А = —1: 
22||х 0 
ОНЫН aun 
2x+2y=0 
4x+4y = 0 


which is simplified by subtracting twice the first row from the second row in (21.7): 


[oo] В 7 В (21.8) 


Let у = t, therefore x = —t, which is represented by: 
xi _ |=| ? —1 
yl | {|7 1 
1 
Next, we substitute A = 5: 
-4 2 x 0 
Fb] Lo _ 


—4х+2у=0 
4x —2y 20 


Thus the eigenvector is: 


which is simplified by adding the first row to the second row in (21.9): 


9121-10 
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Let у = t, therefore x = 1/2, which is represented by: 
Е ЕЙ ү | 
= =f 
y t 1 


1 
There are two vectors that satisfy these requirements: 


Ld 


and they are eigenvectors with an associated eigenvalue of — 1 and 5. 


Thus the eigenvector is: 


Test to see if the eigenvectors are scaled by the eigenvalue, and transformed. 
12||-1| | 1 as 1| |1 
43 1| |-1J|' 1| |-1 
12||0.55|) | 2.5 5 05|_ |25 
43 1] ] 5}? 1} | 5 


Chapter 22 A) 
Calculus: Derivatives б 


22.1 Introduction 


This chapter starts differential calculus, and begins with the origin of a function’s 
derivative, and the idea of a limiting value. The chapter continues by showing how 
the derivative is calculated for different functions, and concludes with several worked 
examples. 


22.2 Background 


Although the principles of calculus were being used by Archimedes (287-212 B.C.) 
to compute areas, volumes and centres of gravity, it was the English astronomer, 
physicist and mathematician Isaac Newton (1643-1727) and Gottfried Leibniz who 
are regarded as the true inventors of modern calculus. Leibniz published his results 
in 1684, followed by Newton in 1704. However, Newton had been using his calculus 
of fluxions as early as 1665. Since then, calculus has evolved conceptually and in 
notation. 

Up until recently, calculus was described using infinitesimals, which are num- 
bers so small, they can be ignored in certain products. However, infinitesimals, no 
matter how small they are, do not belong to an axiomatic mathematical system, 
and eventually the French mathematician Augustin-Louis Cauchy, and the German 
mathematician Karl Weierstrass (1815-1897), showed how they could be replaced 
by limits. 


22.3 Infinitesimals 


The adjective small is a relative term, and requires clarification in the context of 
numbers. For example, if numbers are in the hundreds, and also contain some decimal 
component, then for any quick calculation, one can ignore digits after the 3rd decimal 
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place. For instance: 
100.000003 x 200.000006 ~ 20, 000 


where the decimal part has no significant impact on the general accuracy of the 
answer, which is measured in tens of thousands. 

Let's divide a number into two parts: a primary part x, and an infinitesimal part ôx, 
where x = 100 and ôx = 0.000003 (pronounced ‘delta x’). Given two such numbers, 
хі and уг, their product is given by: 


х= х + х 
yı = у + dy 
хуу, = (х + 0x)Cy + бу) 
= xy + xóy + удх + dxdy 


Using x; = 100.000003 and y; = 200.000006 we have: 


хіуі = 100 x 200 + 100 x 0.000006 + 200 x 0.000003 + 0.000003 х 0.000006 
= 20, 000 + 0.0006 + 0.0006 + 0.00000000018 
= 20, 000 + 0.0012 + 0.00000000018 
= 20, 000.00120000018 


where it is clear that the products хбу, yóx and дхду contribute very little to the 
result. Furthermore, the smaller we make 6x and dy, their contribution becomes 
even more insignificant. Just imagine if we reduce ôx and ôy to the level of quantum 
phenomenon, e.g. 10-34, then their products play no part in every-day numbers. But 
there is no need to stop there, we can make óx and óy as small as we like, e.g. 
1 07 100000000000. Т ater on we employ the device of reducing a number towards zero, 
such that any products involving them can be dropped from any calculation. 

Even though the product of two numbers less than zero is an even smaller number, 
care must be taken with their quotients. For example, in the above scenario, where 
бу = 0.000006 and дх = 0.000003: 


бу _ 0.000006 _ 


ôx 0.000003 — 


so we must watch out for such quotients. 

Differential calculus is concerned with the rate at which a function changes relative 
to one of its independent variables, and employs the ratio 6у/бх to compute this 
value. The limiting value of this ratio is called the function’s derivative, and we 
will explore two ways of computing it, and provide a graphical interpretation of the 
process. The first method uses simple algebraic equations, and the second uses a 
functional representation. Needless to say, they both give the same result. 


22.4 Equations and Limits 451 


22.4 Equations and Limits 


22.4.1 Quadratic Function 


Here is a simple algebraic approach using limits to compute the derivative of a 
quadratic function. Starting with the function y = x?, let x change by ôx, and let бу 
be the corresponding change in y. No values are assigned to x, ôx or dy, at this stage, 
itis an algebraic argument. We then have: 
у-х 
y+ by = (x + dx)? 
= х? + 2x -ôx + (8х)? 
dy = 2x - dx + (6х)? 


Dividing throughout by ôx we have: 


6 

= 2х + dx 

ôx 

The ratio ôy /ôx provides a measure of how fast y changes relative to x, in increments 
of ôx. For example, when x = 10: 


5 
2Y 20+ 8x 
ôx 


and if ôx = 1, then dy/dx = 21. Equally, if 6x = 0.001, then ôy/ôx = 20.001. By 
making дх smaller and smaller, бу becomes equally smaller, and their ratio converges 


towards a limiting value of 20. 
In this case, as ôx approaches zero, ду/дх approaches 2x, which is written: 


Thus in the limit, when ôx = 0, we create a condition where ду is divided by zero— 
which is a meaningless operation. However, if we hold onto the idea of a limit, as 
óx — 0, it is obvious that the quotient A is converging towards 2x. The subterfuge 


employed to avoid dividing by zero is to substitute 2 to stand for the limiting 
condition: 
dy _. бу 
— = lm --- 


= 2х 
ах 6x0 ôx 


ay (pronounced dee y dee x) is the derivative of у = x? 


, le. 2x. For instance, when 
x =0, dy — 0, and when x = 3, 4% — 6. The derivative ду, is the instantaneous rate 
dx dx dx 


at which y changes relative to x. 


452 22 Calculus: Derivatives 


If we had represented this equation as a function: 


/@) = х? 
Je x 


where f'(x) is another way of writing Ч. 
Now let’s introduce two constants into the original quadratic equation to see what 
effect, if any, they have on the derivative. We begin with: 
у= ах? +b 


and increment x and у: 


y+ ду = а(х + 8х)? +b 
= a(x? + 2хёх + (6х)?) +b 
бу =a (2xóx + (8x)?) 


dividing throughout by 6x: 


бу 

— = а(2х + ôx) 

бх 

and the derivative is: 
dy бу 
— = m -- 
ах ӛх->0 6x 
= 2ax 


Thus the added constant b disappears (i.e. because it does not change), whilst the 
multiplied constant а is transmitted through to the derivative. 


22.4.2 Cubic Equation 


Now let's repeat the above analysis for the cubic equation у = x°: 


у= х) 


у + dy = (х + ôx)? 
= x? + 3x78x + 3x(dx)* + (8x? 
dy = 3x7dx + 3x (8x)? + (8x? 
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Dividing throughout by ôx: 
ô 
Z = 3x? + 3хбх + (x)? 
ôx 


Using limits, we have: 


or 


dy ‚ бу 

= lm — 
dx ӛх->0 6x 
= 3x7 


We could also show that if y = ax? + b then: 


e» = Зах? 
dx 
This incremental technique can be used to compute the derivative of all sorts of 
functions. 
If wecontinue computing the derivatives of higher-order polynomials, we discover 
the following pattern: 


у= х2, Zox 
yox, E 
у= х“, ^ cap 
y=x, 3 s 


Clearly, the rule is: 


d 
ym Z cnt 
x 


but we need to prove why this is so. The solution is found in the binomial expansion 
for (x + 6x)", which can be divided into three components: 


1. Decreasing terms of x. 
2. Increasing terms of бх. 
3. The terms of Pascal’s triangle. 
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For example, the individual terms of (x + 5x)* are: 


Decreasing terms of x: x^ x3 x? xl x9 
Increasing terms of dx: (8х)0 (6х)! (8x)? (8x)? (8x)* 
The terms of Pascal’s triangle: 1 4 6 4 1 


which combined, produce: 
x^ + 4x3 (5x) + 6x? (6х)? + 4х(8х)? + (8x)* 


Thus when we begin an incremental analysis: 


у= х* 


у + бу = (x + ôx) 
= x44 Ax? (8x) + 6x? (6х)? + 4х(8х)? + (8x)* 
dy = Ax (8x) + 6x7 (6х)? + Ax (8x)? + (x)* 


Dividing throughout by ôx: 
8 
m = 4х3 + 6x? (8x)! + Ax (8x)? + (8x? 
X 


In the limit, as óx slides to zero, only the second term of the original binomial 
expansion remains: 4x?. The second term of the binomial expansion (1 + 5x)” is 
always of the form: nx"-!, which is the proof we require. 


22.4.3 Functions and Limits 


In order to generalise the above findings, let's approach the above analysis using 
a function of the form y = f(x). We begin by noting some arbitrary value of its 
independent variable and note the function's value. In general terms, this is x and 
f (x) respectively. We then increase x by a small amount 6x, to give x + óx, and 
measure the function's value again: f(x + 8х). The function's change in value is 
f (x + 6x) — f (x), whilst the change in the independent variable is 6x. The quotient 
of these two quantities approximates to the function's rate of change at x: 


Жо + х) — f(x) 
ôx 


(22.1) 


By making ôx smaller and smaller towards zero, (22.1) converges towards a limiting 
value expressed as: 


GY gg AE a (22.2) 
dx | 8x20 ôx 
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which can be used to compute all sorts of functions. For example, to compute the 
derivative of sin x we proceed as follows: 


у = sinx 
y + ду = sin(x + dx) 


Using the identity sin(A + В) = sin А - cos В + cos А - sin B, we have: 


y + dy = sin x - соѕ(дх) + cos x - sin(dx) 
бу = sin x - cos(óx) + cos x - sin(óx) — sin x 


= sin х(соѕ(дх) — 1) + cos x - sin(6x) 
Dividing throughout by ôx we have: 


бу sin x sin(dx) 
— = (соѕ(8х) — 1) + 
óx Ox ôx 


cos x 


In the limit as ôx — 0, (cos(óx) — 1) —> О and sin(ôx)/ôx = 1, (See Appendix А) 


and: | 
dy _ d(sin x) _ 


dx ах 


COS X 


Before moving on, let's compute the derivative of cos x: 


у = cos x 


у + ду = cos(x + 6x) 
Using the identity cos(A + B) = cos A - cos В — sin А - sin В, we have: 


у + dy = cos x - с0$(6х) — sin x - sin(Óx) 
бу = cos x · cos(óx) — sin x - sin(6x) — cos x 


= cos x(cos(óx) — 1) — sin x - sin(Óx) 
Dividing throughout by ôx we have: 


бу | cosx sin(dx) 


522 (cos(óx) — 1) — sin x 


In the limit as ôx — 0, (cos(óx) — 1) — О and віп(бх)/6х = 1 (See Appendix А), 


and: 
dy  d(cosx) _ 


dx dx 


sin x 


We will continue to employ this strategy to compute the derivatives of other functions 
later on. 
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22.4.4 Graphical Interpretation of the Derivative 


For a graphical interpretation of the derivative, consider the scenario in Fig. 
22.1 where the sample point is P. In this case the function is f(x) = x? and P’s 
coordinates аге (x, x?). We identify another point R, displaced ёх to the right of P, 
with coordinates (x + ôx, x?). The point О on the curve, vertically above №, has 
coordinates (x + х, (x + дх)?). When 6x is relatively small, the slope of the line 
РО approximates to the function’s rate of change at P, which is the graph’s slope. 
This is given by: 


ОК (x + bx)? — x? 
PR 5x 
x? + 2x (6х) + (6х)? — x? 
ôx 
2x(8x) + (6x)? 
n ôx 
= 2x + ôx 


slope = 


We can now reason that as ôx is made smaller and smaller, Q approaches P, and 
slope becomes the graph’s slope at P. This is the limiting condition: 


dy _ 


тасты 


Thus, for any point with coordinates (x, х2), the slope is given by 2x. For example, 
when x = 0, the slope is 0, and when x = 4, the slope is 8, etc. 


Fig. 22.1 Sketch of 


Ту YA Q у 
4 
/ 
| и | 
Р p /” 
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22.4.5 Derivatives and Differentials 


Given a function f(x), ae represents the instantaneous change of f for some x, and 
is called the first derivative of f(x). For linear functions, this is constant, for other 
functions, the derivative’s value changes with x and is represented by a function. 

The elements df, dy and dx are called differentials, and historically, the deriva- 
tive used to be called the differential coefficient, but has now been dropped in favour 
of derivative. One can see how the idea of a differential coefficient arose if we write, 
for example: 


dy 
DAD 
dx * 
as 
dy = 3x dx 


In this case, 3x acts like a coefficient of dx. However, today, the derivative is regarded 
as an operator, and even though it is written: 


it really means: 


d 
where the operator 7- acts on y. 


22.4.6 Integration and Antiderivatives 


If it is possible to differentiate a function, it seems reasonable to assume the exis- 
tence of an inverse operator which turns the derivative back to the original function. 
Fortunately, this is the case, but there are some limitations. This inverse process is 
called integration and reveals the antiderivative of a function. Many functions can 
be paired together in the form of a derivative and an antiderivative, such as 2x with 
x?, and cos x with sin x. However, there are many functions where it is impossible 
to derive its antiderivative in a precise form. For example, there is no simple, finite 
functional antiderivative for sin(x?) or (sin x) /x. To understand integration, let's 
begin with a simple derivative. 
If we have: 


d 
«У = 18x? — 8x +8 
dx 
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it is not too difficult to reason that the original function could have been: 
у = 6x? — Ax? + 8х 
However, it could have also been: 
у = 6x? — 4х2 + 8x +2 


or 
у = 6x? — Ax? + 8х + 20 


or with any other constant. Consequently, the integration process has to include an 
arbitrary constant C: 
у = бх? — 4х2 + 8x + C 
The value of C is not always required, but it can be determined if we are given some 
extra information, such as y = 10 when x = 0, then C = 10. 
Given a function: 
у = 6x? — Ax? + 8x + 10 
its derivative is written: 


d 
—y = 18x? — 8x +8 
dx 


The antiderivative of 18x? — 8x + 8 reveals the original function, and is written: 
y= f 08? -8x mas 

although brackets are not always used: 
y= f ise -8x +8 ax 


This equation reads: ‘y equals the integral of 18x? — 8x + 8 dee x.’ The dx reminds 
us that x is the independent variable. In this case we can write the answer: 


y= f ise -8x +8 ax 
= бх? — 4х? + 8x + С 


where C is some constant. 
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The antiderivatives for the sine and cosine functions are written: 
J dx = — cosx + C 


| 5» dx = sinx +С 


which you may think obvious, as we have just computed their derivatives. How- 
ever, the reason for introducing integration alongside differentiation, is to make you 
familiar with the notation, and memorise the two distinct processes, as well as lay 
the foundations for the next chapter. 


22.5 Function Types 


Mathematical functions come in all sorts of shapes and sizes. Sometimes they are 
described explicitly where y equals some function of its independent variable(s), 
such as: 

y=xsinx 


or implicitly where y, and its independent variable(s) are part of an equation, such 
as: 
x? у? = 10 


A function may reference other functions, such as: 
cue 2 
у = sin(cos^ x) 


or 


sin x 


There is no limit to the way functions can be combined, which makes it impossible 
to cover every eventuality. Nevertheless, we will explore some useful combinations 
that prepare us for any future surprises. 

First, we examine how to differentiate different function types, that include sums, 
products and quotients, which are employed later on to differentiate specific functions 
such as trigonometric, logarithmic and hyperbolic. Where relevant, I include the 
appropriate antiderivative to complement its derivative. 


22.6 Differentiating Groups of Functions 


So far, we have only considered simple individual functions, which unfortunately, do 
not represent the equations found in mathematics, science, physics or even computer 
science. In general, the functions we have to differentiate include sums of functions, 


460 22 Calculus: Derivatives 


functions of functions, function products and function quotients. Let’s explore these 
four scenarios. 


22.6.1 Sums of Functions 


A function normally computes a numerical value from its independent variable(s), 
and if it can be differentiated, its derivative generates another function with the same 
independent variable. Consequently, if a function contains two functions of x, such 
as u and v, where: 

y = u(x) + v(x) 


which can be abbreviated to: 


y=u+v 
then: 
dy du dv 
dx dx dx 


where we just sum their individual derivatives. For example: find n. given: 


u = 2х6 

у = 3x? 

у=и-у 

y = 2х6 + 3x9 

Differentiating у: 

dy 5 4 
— = 12х° + 15x 
dx 


Figure 22.2 shows a graph of у = 2х6 + зшх + созх and its derivative, ау = 
12x? + cos x — sin x. Differentiating such functions is relatively easy, so too, is inte- 
grating. Given: 


dy du ау 


dx ах dx 


then: 


22.6 Differentiating Groups of Functions 461 


Fig. 22.2 Graph of 


у = 2х6 + sin x + cos х ЗА I 
(blue) and its derivative, 
a = 12x5 + cos x — sin x 2 
(green) — 
N 
1 0 1 >. 


For example, given: 
d 
= 12x? + cos x — sin x 
dx 


we find y by integrating: 


y= | 12х°х+ f cosxax— | sinx as 


= 2х6 + sin x + cosx + C 


22.6.2 Function of a Function 


One of the advantages of modern mathematical notation is that it lends itself to 
unlimited elaboration without introducing any new symbols. For example, the poly- 
nomial 3x? + 2x is easily raised to some power by adding brackets and an appropriate 
index: (3x? + 2x)*. Such an object is a function of a function, because the function 
3x? + 2x is subjected to a further squaring function. The question now is: how аге 
such functions differentiated? Well, the answer is relatively easy, but does introduce 
some new ideas. 

Imagine that person A swims twice as fast as person B, who in turn, swims three 
times as fast as person C. It should be obvious that person A swims six (2 x 3) 
times faster than person C. This product rule, also applies to derivatives, because if 
у changes twice as fast as и, i.e. D = 2, and u changes three times as fast as x, i.e. 
du = 3, then y changes six times as fast as x: 

dy dy du 
dx du dx 


To differentiate: 
y= (3x? + 2х)? 
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we substitute: 


и = Зх? + 2х 
then: 
у=? 
апа 
а 
xad —2u 
du 
= 2(3х2 + 2x) 
= 6х2 + 4x 
Next, we require du, 
u = 3х2 + 2х 
OH dud 
DU. = '6x 
dx 
therefore, we can write: 
dy ау du 
dx du dx 


= (6x? + Ax)(6x + 2) 
= 36x? + 36x? + 8x 


This result is easily verified by expanding the original polynomial and differentiating: 


y= (3х2 + 2х)? 
= (3x? + 2х)(3х2 + 2x) 
= 9х* + 12х3 + Ax? 


d 
СУ = 36x? + 36x? + 8x 
dx 


Figure 22.3 shows a graph of у = (3x? + 2x)? and its derivative, a = 36х3 + 
36х? + 8х. 


у = sin(ax) is a function of a function, and is differentiated as follows: 


y = sin(ax) 
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Fig. 22.3 Graph of 
у = (3х2 + 2x)? (blue) and \ 
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its derivative, 
® = 36x? + 36x? + 8x 
(green) 


У 


Substitute и for ax: 


dy 
du 


Next, we require du: 
Ж 


therefore, we can write: 


dy ау du 
dx du dx 
= СО5(ах)-а 


= acos(ax) 
Consequently, given: 


ay (ax) 
— = COS 
dx ax 


then: 


y= fosa dx 


1 
= —sin(ax) + C 
a 
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Similarly, given: 


2% = sin(ax) 


dx 


then: 


y= [ sin(ax) dx 


1 
= ——cos(ax) + C 
а 
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The equation y = sin(x?) is also a function of a function, and is differentiated as 


follows: 
y = sin(x?) 
Substitute и for x?: 
у = ши 
y 
— = cosu 
du 
= cos(x?) 


du. 


Next, we require "EE 


и = х? 


du = 2% 
dx 
therefore, we can write: 
dy dy du 
dx du ах 
= cos(x?) -2х 


= 2x cos(x?) 


Figure 22.4 shows a graph of y — sin(x?) and its derivative, dy — 2x cos(x?). In 


dx 


general, there can be any depth of functions within a function, which permits us to 


write the chain rule for derivatives: 


dy dy du dv dw 


dx du dv dw dx 


22.6 Differentiating Groups of Functions 465 


Fig. 22.4 Graph of 


y 
y — sin x? (blue) and its 8А 
derivative, йу = 2x cos(x?) 
(green) 


| 
мж. ШЕШ E 


22.6.3 Function Products 


Function products occur frequently in every-day mathematics, and involve the prod- 
uct of two, or more functions. Here are three simple examples: 


у = (3х2 + 2x) (2x? + Зх) 
у = sin x - cos x 


y=x sinx 


When it comes to differentiating function products of the form: 


y=uv 
it seems natural to assume that: 
dy du dv 
ALTE eo ан 22.3 
dx dx dx ( ) 


which unfortunately, is incorrect. For example, in the case of: 
у = (3х2 + 2x) (2x7 + Зх) 


differentiating using (22.3) produces: 


d 
СУ L (6x 3x23) 
ах 


= 24x? + 26x +6 
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However, if we expand the original product and then differentiate, we obtain: 


у = (3х2 + 2x) (2х? + 3x) 
= бх“ + 13x? + 6х2 


d 
СУ _ 24x3 + 39x? + 12x 
dx 


which is correct, but differs from the first result. Obviously, (22.3) must be wrong. 
So let’s return to first principles and discover the correct rule. 

So far, we have incremented the independent variable—normally x—by ôx to 
discover the change in y—normally dy. Next, we see how the same notation can be 
used to increment functions. 

Given the following functions of x, и and v, where: 


y=uyv 


If x increases by 5x, then there will be corresponding changes of ди, dv and dy, in 
u, v and y respectively. Therefore: 


y+ ду = (и + ди) (у + ду) 
= иу + иду + уди + диду 
бу = udv + vóu + dudv 


Dividing throughout by 6x we have: 


ду 
ôx 


_ uum LL óv 
TE ЕР "Sy 


In the limiting condition: 


dy : óv : óu . бу 
— = lm [и] lim |у- |+ lim [óu— 
dx бх->0 ôx 5x0 ôx 5x30 ôx 


As ôx — 0, then би — 0 and (8и ov) — 0. Therefore: 


= и +y (22.4) 
Using (22.4) for the original function product: 


и = Зх? + 2х 
у= 2х2 + Зх 


у = иу 
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du 

— —6 2 

dx oe 

dv 

— = 4 3 

dx кт 

dy dv du 
=u Tv 


dx dx dx 


= (3x? + 2х) (4х + 3) + Qx? + 3x) (6x + 2) 
= (12x? + 17x? + 6x) + (12x? + 22x? + бх) 


= 24x? + 39x? + 12x 
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which agrees with our previous prediction. Figure 22.5 shows the graph of y = 
(3x? + 2x)(2x? + 3x) and its derivative d» — 24x? + 39x? + 12x. 


> dx 


The equation у = sin x - cosx contains the product of two functions and is dif- 


ferentiated using (22.4) as follows: 


y-sinx-cosx 


u — sinx 
du 
— = cosx 
dx 
v = cosx 
dv А 
— = —sinx 
dx 
dy dv du 


dx dx m dx 


= sin x(— sin x) + cos x · cosx 


— cos? x — sin? x 
— cos(2x) 
Fig.22.5 Graph of y | 
y = (3х2 + 2x)(2x? + Зх) | А | | 
(blue), and its derivative, | 5 | 
ДУ = 24х3 + 39x? + 12x | | 
(green) | 2 Г 
Ау 
\ | p» 
5 | j б 4 
1-2 i 
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Fig. 22.6 Graph of 


2 У 
у = sin x - cos x (blue), and A 
its derivative, йу = cos(2x) 1 
(green) 
п п 0 T п X 


Using the identity sin(2x) = 2 sin х - cos x, we can rewrite the original function as: 
y-sinx-cosx 
Is Qx) 
= = 511(2х 
2 


ау 
— = 2 
di cos(2x) 


which confirms the above derivative. Now let's consider the antiderivative of cos(2x). 


Given: 


dy 
— = 2 
FE cos(2x) 


then: 


y- [coven dx 


1 
= sin(2x) + C 


= sin х: cosx + C 


Figure 22.6 shows the graph of y = sin x - cos x and its derivative, 2 — cos(2x). 


22.6.4 Function Quotients 
Next, we investigate how to differentiate function quotients. We begin with two 
functions of x, u and v, where: 
u 
M 
у 


which makes y also a function of x. 
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We now increment x by óx and measure the change in и as би, and the change in 
v as dv. Consequently, the change in y is dy: 


и + ди 


vd ду 
и + ди и 


v+év v 
v(u + ди) — u(v + ду) 
v(v + ду) 
уди — иду 
= v(v + бу) 


у+бу= 


бу 


Dividing throughout by ôx we have: 


бх v(v + ду) 


As 6x — 0, би, dv апа бу also tend towards zero, and the limiting conditions are: 


dy . ôy 
— = lim — 
dx ӛх->0 дх 
du н би 
—— = hm ERE 
Уах ӛх->0 pr 
dv . бу 
= Ши и- 


и — 
ах 6x30 ôx 


25-0 
v= jim, v(v + ду) 


therefore: 
du dv 


dy “ах “dx 
dx y 


As an example, let's differentiate: 


Ox 2x5 +3х +6 
х2 +3 


Substitute и = x? + 2x? + 3x + 6 and v = x? + 3, then: 


du 
dx 
ау 
ах 


= 3х? +4х +3 


= 2x 
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Fig. 22.7 Graph of y = 


y 2 
(x? + 3)(x + 2)/(x? + 3) A 
(blue), and its derivative, 2- 
dy _ | (green) Е 
ах“ 5 Ж 
4 
x 
4 3 = 4 9 1 2 у 4” 
P d 
P d 
ж К 
Ед 
ut f 


dy _ (x? +. 3)x? + 4x +3) — (x? + 2х? + 3x + 6) (2х) 


dx (x? + 3)? 
(3x4 + 4x? + 3х? + 9х? + 12x + 9) — (2x4 + 4x? + 6x? + 12x) 
= х4--6х2--9 
x* + 6x? +9 
~ x44 6x2 +9 
=1 


which is not а surprising result when one sees that the original function has the 
factors: " 
3 2 
_@+D@4+2 |, 
x? +43 


whose derivative is 1. Figure 22.7 shows a graph of y = (х2 + 3)(х + 29 (x? + 3) 
and its derivative, ay zu 
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Simple functions fall conveniently into two types: explicit and implicit. An explicit 
function, describes a function in terms of its independent variable(s), such as 


y=asinx + b cosx 
where the value of y is determined by the values of a, b and x. On the other hand, 
an implicit function, such as: 
х? + у? = 25 


combines the function’s name with its definition. In this case, it is easy to untangle 


the explicit form: 
y= у25 – x? 
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So far, we have only considered differentiating explicit functions, so now let’s exam- 
ine how to differentiate implicit functions. Let’s begin with a simple explicit function 
and differentiate it as it is converted into its implicit form. 
Let: 
у = 2х2 + 3х +4 


then: 4 
y 
—=4 3. 
dx vm 


Now let's start the conversion into the implicit form by bringing the constant 4 over 
to the left-hand side: 
y—4= 2х? + 3x 


Differentiating both sides: 
dy 
— = 4х +3 
dx 


Bringing 4 and 3x across to the left-hand side: 
у= 3x —4 = 2x? 


Differentiating both sides: 


Finally, we have: 


Differentiating both sides: 


dy 
— —4x-320 
dx " 


dy 
— = 4x +3 
dx к, 


which seems straight forward. The reason for working through this example is to 
remind us that when y is differentiated we get an Let’s differentiate these two 
examples: 


y+sinx + 4х 20 and y+x*—cosx = 0 
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Differentiating the individual terms: 
y+sinx + 4х = 0 


а 
ао) 


ах 
а 
ЕР ee 
ах 

у + х2 — соѕх = 0 

а 

BT Lord rc 

dx 
d 
Le eer 
dx 


But how do we differentiate y? + x? = r?? Well, the important difference between 
this implicit function and previous functions, is that it involves a function of a func- 
tion. y is not only a function of x, but is squared, which means that we must employ 
the chain rule described earlier: 


dy dy du 
dx du dx 
Therefore, given: 
Ух? = 2 
dy 
2y—+2x = 
Уах Tum 
dy | —2x 
dx 2y 
PEE. 
Ev 


This is readily confirmed by expressing the original function in its explicit form and 
differentiating: 
у= (72 — х2)? 


which is a function of a function. 
Let u = r? — x?, then: 
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As у = из, then: 


dy 1 
аи 2 


However: 


dy ау du 
dx аи dx 


PUES) 


which agrees with the implicit differentiated form. 

22.8 Differentiating Exponential and Logarithmic 
Functions 

22.8.1 Exponential Functions 


Exponential functions have the form y — a*, where the independent variable is the 
exponent. Such functions are used to describe various forms of growth or decay, from 
the compound interest law, to the rate at which a cup of tea cools down. One special 
value of a = 2.718282.., called e, where: 


1 n 
e= lim (1+ 2) 
поо п 


1 nx 
е" = lim (1 + :) 
поо п 


which, using the Binomial Theorem, is: 


Raising e to the power x: 
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If we let: 
у= е 
dy 1+ na um 
c NE a r E 
dx 2! 3! 4! 
= е* 


X 


which is itself. Figure 22.8 shows graphs of y = e* and y=e™. 
Now let's differentiate y — a*. We know from the rules of logarithms that: 


log x" 2 nlogx 


therefore, given: 


ya 
then: 
In y =Ina* = xlna 
therefore: 
y ех Ina 
which means that: 
а“ ех Ina 
Consequently: 
а x — ех Ina 
dx dx 


Fig. 22.8 Graphs of y = e* 
andy =e * \ 


>< 
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= ehana 
= a`“ ша 
It can Бе shown that: 

god hus 
spe = 
е Ет 
—— 
yd, Э = а* ша 
уа; і --а “Ша 


The exponential antiderivatives are written: 
[ е dx =e" +С 


jus dx =-e*+C 


22.8.2 Logarithmic Functions 


Given a function of the form: 
y=Inx 


then: 
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Therefore: 
dx = e 
dy 
LX 
dy 1 
ах x 
Thus: 
d 
— nx = — 
dx 
Figure 22.9 shows the graph of y — In x and its derivative, Ч = 1. Conversely: 


1 
[ imc 
x 
When differentiating logarithms to a base a, we employ the conversion formula: 


у = log, x 
= (In x) (log, e) 


whose derivative is: 
dy 1 | 
— = —1ое е. 
ах x ба 


When а = 10, then 102106 © 0.4343... and: 


d | 0.4343 
— 108) X % 
dx 9810 
Fi = i ivative, 9 — 0.4343 
igure 22.10 shows the graph of у = logy, x and its derivative, 77 = ———. 
Fig. 22.9 Graph of y = Inx y 
(blue), and its derivative, A \ [== 
2 = 1 (green) = | 
1 
| 1 
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Fig. 22.10 Graph of 


у = log) x (blue), and its УА 
derivative, 4у А 04543 | 
(green) Р 
ШЕШЕН 
EE T2 2 3 4 5 6 x 


22.0 Differentiating Trigonometric Functions 


We have only differentiated two trigonometric functions: sin x and cos x, so let's add 
tan x, csc x, sec x and cot x to the list, as well as their inverse forms. 


22.9.1 Differentiating tan 


Rather than return to first principles and start incrementing x by ёх, we can employ 
the rules for differentiating different function combinations and various trigonometric 
identities. In the case of tan(ax), this can be written as: 


tan(ax) = И 
соѕ(ах) 
апа employ the quotient rule: 
du dv 
dy - “dx " dx 
dx v2 


Therefore, let и = sin(ax) and v = cos(ax), and: 


acos(ax) · cos(ax) + a sin(ax) - sin(ax) 


d 
= = 
dx pata) cos? (ах) 


_ a(cos?(ax) + sin? (ax)) 


cos? (ax) 


a 
^ соѕ2(ах) 
= a sec? (ax) 


= а(1 + tan? (ax)) 
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Fig. 22.11 Graph of y 
у = tan x (blue), and its | |] | A | | \ E 
derivative, 4у = 1+ tan? x \ | | | И | | | | | 
(green) | | | 
1 р 
е ; Z > 
E: | Fo 3 
| | 2 | 
Figure 22.11 shows a graph of y = tan x and its derivative, ау = 1 + tan? x. 
It follows that: 1 
| зес? (ах) dx = — tan(ax) + С 
а 
22.9.2 Differentiating csc 
Using the quotient rule: 
у = csc(ax) 
_ я 
~~ sin(ax) 
d 0 — acos(ax) 
—— свс(ах) = —_,—— 
dx sin“ (ax) 
_ —acos(ax) 
~ sin'(ax) 
а cos(ax) 
sin(ax) sin(ax) 

= —acsc(ax) - cot(ax) 

Figure 22.12 shows a graph of у = csc x and its derivative, ЧУ = —cscx-cotx. 


It follows that: 1 
csc(ax) - cot(ax) dx = -- csc(ax) + C 
a 


22.9 Differentiating Trigonometric Functions 


Fig. 22.12 Graph of | | 
у = csc x (blue), and its | 
у 


derivative, йу = | | 
— csc x - cot x (green) | | 


22.9.3 Differentiating sec 


Using the quotient rule: 


y — sec(ax) 
1 
cos(ax) 
d —(—а sin(ax)) 
— sec(ax) = — 
dx cos? (ax) 


a sin(ax) 


cos? (ах) 
a sin(ax) 


cos(ax) cos(ax) 


= asec(ax) - tan(ax) 
Figure 22.13 shows a graph of y = sec x and its derivative, ay = sec x ·їап х. 


It follows that: 


1 
sec(ax) - tan(ax) dx = — sec(ax) + C 
a 


22.9.4 Differentiating cot 


Using the quotient rule: 


— cot(ax) 
1 


tan(ax) 


| 
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Fig. 22.13 Graph of 


22 Calculus: Derivatives 


oe 11 и ||| 
n Uli VAL i 
"ES ГА 

А Af? 
ПЕН 


у = cot x (blue), and its 
derivative, = 
x 


—(1 + cot? x) (green) 


Figure 22.14 shows a graph of y = cot x and its derivative 


It follows that: 


d 
— cot = 
d | tan? (ax) 


a 


—a sec? (ах) 


cos? (ах) 


© cos?(ax) sin? (ax) 


a 


^o sim (ax) 


= —a csc? (ax) 


= —a(1 + cot? (ax)) 


| a = —(1 + cot? x). 


1 
| 1+ cot? (ax) dx = —- cot(ax) + C 
a 
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22.9.5 Differentiating arcsin, arccos and arctan 


These inverse functions are solved using a clever strategy. 
Let: 


x =siny 


then: 
у = arcsin x 


Differentiating the first expression, we have: 


dx 

dp my 
dy 1 
dx | COS y 


As sin? y 4- cos? y — 1, then: 


cosy = 4/ 1 — sin? y = y 1 — x? 


and 1 
-- arcsin x = ———— 
qu л: 
Using a similar technique, it can be shown that: 
1 
— arccos x = — —————— 
dx VT = x? 
d 1 
-- arctanx = 
dx 14x? 


It follows that: 
[ inx +С 
————— = arcsinx 
V1 — x? 
[ ы +С 
———— = arccos х 
V1 — x? 
dx 
[ Tm, — arctan x +C 
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22.9.6 Differentiating arccsc, arcsec and arccot 


Let: 
у = arccsc x 
then: 
X = CSC у 
" 1 
^ siny 
dx _ —cosy 
dy 7 sin? y 
dy  —sin?y 
dx COS y 
B 1 X 
х2,/х2-1 
1 
— arccsc x = — 
хү/х2-і1 
Similarly: 
d 1 
— arcsec х = 
dx хү/х2-1 
а 1 
— arccot x = — 
dx х2--1 
It follows: 


dx 
—— = arcsec |x| + С 
| хух2 — 1 А! 


| ах 
—— —— = — arccot x + C 
х +1 


22.10 Differentiating Hyperbolic Functions 


Trigonometric functions are useful for parametric, circular motion, whereas, hyper- 
bolic functions arise in equations for the absorption of light, mechanics and in integral 
calculus. Figure 22.15 shows graphs of the unit circle and a hyperbola whose respec- 
tive equations are: 


22.10 Differentiating Hyperbolic Functions 483 


Fig. 22.15 Graphs of the y 
unit circle x? + у? = 1 and А 
the hyperbola x? — y? = 1 » 


2 
Y^ Y 
-4 -8 -p NTA 2 3 iT 
iz 


yt 
x+y =1 
x-y =l 


where the only difference between them is a sign. The parametric form for the 
trigonometric, or circular functions and the hyperbolic functions are respectively: 


sin? 0 + cos? 0 = 1 


cosh? x — sinh? x = 1 


The three hyperbolic functions have the following definitions: 


. —e 
sinh x — 
2 
cosh x — са 
2 
sinh х e* — 1 
tanhx = = —— 
coshx ех +1 
and their reciprocals аге: 
1 2 
cosechx = — = 
sinh x ех – ех 
1 2 
sech х = = 
cosh х ех + ех 
1 Pu 
coth x — = e - i 
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Table 22.1 Hyperbolic function names 


Function | Reciprocal | Inverse function Inverse reciprocal 


sinh cosech arsinh arcsch 
cosh sech arcosh arsech 
tanh coth artanh arcoth 


Table 22.2 Rules for differentiating hyperbolic functions 


y dy/dx 
sinh x cosh x 
cosh x sinh x 
tanh x sech? x 
cosechx | —cosechx · coth x 
sech x — sech x - tanh x 
coth x — cosech? x 


Other useful identities include: 


sech?x = 1 — tanh? x 


cosech?x — coth? x — 1 


The coordinates of P and О in Fig. 22.15 are given by P(cos0, sin@) and 
Q(cosh x, sinh x). 

Table 22.1 shows the names of the three hyperbolic functions, their reciprocals 
and inverse forms. As these functions are based upon e* and е“, they are relatively 
easy to differentiate. 


22.10.1 Differentiating sinh, cosh and tanh 


Table 22.2 gives the rules for differentiating hyperbolic functions, and Table 22.3 for 
inverse hyperbolic functions. 

Table 22.4 gives the rules for integrating hyperbolic functions, and Table 22.5 for 
inverse hyperbolic functions. 


22.11 Higher Derivatives 


There are three parts to this section: The first part shows what happens when a function 
is repeatedly differentiated; the second shows how these higher derivatives resolve 
local minimum and maximum conditions; and the third part provides a physical 
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Table 22.3 Rules for differentiating inverse hyperbolic functions 


y dy/dx 
inh 1 
arsinh x —— 
V1+ x? 
1 
arcosh x 
х2-1 
1 
artanh x 
1 — x? 
h 1 
arcschx | —————— 
x^ 1-4 x2 
A 1 
arsechx | —————— 
хУ1-х2 
1 
arcoth x ——— 
x2—1 


Table 22.4 Rules for integrating hyperbolic functions 
жо | ff) dx 
sinh x cosh x + C 


cosh x sinh x + C 
sech?x | tanhx +C 


Table 22.5 Rules for integrating inverse hyperbolic functions 


fœ) f fe) dx 
1 
—— arsinh x + C 
Мт+х? 
1 
————— | arcoshx + С 
М2 —1 
1 
ТЕО artanh x + С 


interpretation for these derivatives. Let’s begin by finding the higher derivatives of 
simple polynomials. 


22.12 Higher Derivatives of a Polynomial 


We have previously seen that polynomials of the form: 


у = ах" + bx! + cx! ... 
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Fig. 22.16 Graph of 


у = 3x? + 2x? — 5x (blue) \ ХА J 
and its derivative Е 
2 = 9x7 4x —5 (green) A ` à 
2 1 ом. 7 154 
a Sh 
-4 
аге differentiated as follows: 
d 
c rax + sbx*-! + tex"! 
dx 


For example, given: 


then: 


which describes how fast y changes relative to x. 

Figure 22.16 shows the graph of у = 3х3 + 2x? — 5x and its derivative ay = 
9х? + 4x — 5, and we сап see that when x = —1 there is a local maximum, where 
the function reaches a value of 4, then begins a downward journey to 0, where the 
slope is —5. Similarly, when x ~ 0.55, there is a point where the function reaches 
a local minimum with a value of approximately —1.65. The slope is zero at both 
points, which is reflected in the graph of the derivative. 

Having differentiated the function once, there is nothing to prevent us differen- 
tiating a second time, but first we require a way to annotate the process, which is 
performed as follows. At a general level, let y be some function of x, then the first 


derivative is: d 


di^ 


The second derivative is found by differentiating the first derivative: 


and is written: 
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Similarly, the third derivative is: 
d? y 
dx? 
and the nth derivative: 
d" y 
ах" 


When a function is expressed as f(x), its derivative is written f'(x). The second 
derivative is written /”(х), and so on for higher derivatives. 
Returning to the original function, the first and second derivatives are: 


d 
oY = 9х2 44x —5 
dx 
а?у 


Figure 22.17 shows the original function and the first two derivatives. The graph of 
the first derivative shows the slope of the original function, whereas the graph of 
the second derivative shows the slope of the first derivative. These graphs help us 
identify a local maximum and minimum. By inspection of Fig. 22.17, when the first 
derivative equals zero, there is a local maximum or a local minimum. Algebraically, 
this is when: 


Fig. 22.17 Graph of 


у = 333 + 2x? — 5x (blue), \ » J 
its first derivative 4 
ду = 9x? + 4x — 5 (green) A BW 
and its second derivative x. 
pu № 
T3 = 18x + 4 (red) 
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d 
BT 50 
dx 
9х2 ++4x —5—0 


Solving this quadratic in x we have: 


_ —b+ Jb? — 4ac 
== 2а 
wherea = 9, b=4, с=—5: 
_ —4 = 716+ 180 
7 18 


ж=-1, х-0.5 


which confirms our earlier analysis. However, what we don’t know, without referring 
to the graphs, whether it is a minimum, or a maximum. 


22.43 Identifying a Local Maximum or Minimum 


Figure 22.18 shows a function containing a local maximum of 5 when x = — 1. 
Note that as the independent variable x, increases from —2 towards 0, the slope of 
the graph changes from positive to negative, passing through zero at x — — 1. This is 
shown in the function's first derivative, which is the straight line passing through the 
points (—2, 6), (—1, 0) and (0, —6). A natural consequence of these conditions 
implies that the slope of the first derivative must be negative: 


d?y 
— = —ve 
dx? 
Fig. 22.18 A function y 
containing a local maximum e^ 
(blue), and its first derivative es 
reen P 24 
(g ) 3g S 
Vi 2X 
А " d \ > 
Е | di 
X 
2 
БМ! 
-6 
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Fig. 22.19 A function 
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containing a local minimum 
(blue), and its first derivative 


(green) 


Figure 22.19 shows another function containing a local minimum of 5 when 


x = —]. Note that as the independent variable x, increases from —2 towards 0, 
the slope of the graph changes from negative to positive, passing through zero at 
x — —]. This is shown in the function's first derivative, which is the straight line 


passing through the points (—2, —6), (—1, 0) and (0, 6). A natural consequence 
of these conditions implies that the slope of the first derivative must be positive: 


а?у 


geo 


We сап now apply this observation to the original function у = 3x? + 2x? — 5x 


for the two values of x, x = —1, хә = 0.5: 
у = 3x? + 2x^ — 5x 


d 
T = 902 48-5 


х 
а?у 
= 18 х (—1) = –18 
= 18 х (0.5) = +10 
Which confirms that when х = —1 there is a local maximum, and when x = 0.5, 


there is а local minimum, as shown in Fig. 22.16. 


22.14 Partial Derivatives 


Up to this point, we have used functions with one independent variable, such as 
у = f(x). However, we must be able to compute derivatives of functions with more 
than one independent variable, such as у = f (и, у, w). The technique employed is 
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to assume that only one variable changes, whilst the other variables are held constant. 
This means that a function can possess several derivatives—one for each independent 
variable. Such derivatives are called partial derivatives and employ a new symbol 
д, which сап be read as ‘partial dee’. 

Given a function f (и, у, w), the three partial derivatives are defined as: 


Of |. f(uch,v,w) — flu, у, м) 
— lim 

ди h—0 h 

af _ im fiu, v 4-h,w) — f(u,v,w) 

ду 50 h 

9f : Хану w + №) — flu, у, м) 
= lim 

ди’ һ—>0 һ 


For example, a function for the volume of a cylinder is: 
V(r, h)=ar-h 


where r is the radius, and h is the height. Say we wish to compute the function’s 
partial derivative with respect to r. First, the partial derivative is written: 


ду 
or 
Second, we hold Л constant, whilst allowing г to change. This means that the function 


becomes: 
V(r, h) 2 kr? (22.5) 


where К = zh. Thus the partial derivative of (22.5) with respect to г is: 


aV 
— =2kr 
or 


= 2лһг 


Next, by holding r constant, and allowing h to change, we have: 


QV 
=_= пр? 
oh 


Sometimes, for purposes of clarification, the partial derivatives identify the constant 
variable(s): 
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Partial differentiation is subject to the same rules for ordinary differentiation—we 
just to have to remember which independent variable changes, and those held con- 
stant. As with ordinary derivatives, we can compute higher-order partial derivatives. 
For example, let’s find the second-order partial derivatives of f(u, v), given: 


flu, у) = u + 2u?v? — 4у? 


The first partial derivatives are: 


af = 4и? + би?у? 
ди 
af = 4иЗу — 12? 
ду 


апа the second-order partial derivatives are: 


92 

57 = 12u? + 12uv? 
u 

32 

5 = 4u? — 24v 
y 


In general, given f(u, v) = uv, then: 


Of _ 
ди Bi. 
of 
ду 


апа the second-order partial derivatives are: 


9? f 
ди? 
Sf 


ду — 


=0 


Similarly, given f(u, у) = u/v, then: 


of | 


ди у 
of и 
ду y»? 
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and the second-order partial derivatives are: 


92 
Ре 
ди? 
9? f _ 2u 
ду2 УЗ 
Finally, given f(u, у) = и”, then: 
of a y—1 
—— = yu 
ди 


whereas, 0f/dv requires some explaining. First, given: 


Ға, v) = и’ 
taking natural logs of both sides, we have: 


Іп f(u, у) = уи 


and 
jens 
Therefore: 
д 
af = е" и 
ду 
=u’ Inu 


The second-order partial derivatives are: 


92 

47 = y(v — 1)u”? 
u 

32 

47 =u" ln’ и 
y 


22.14.1 Visualising Partial Derivatives 
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Functions of the form y = f(x) are represented by a 2D graph, and the function’s 
derivative f'(x) represents the graph's slope at any point x. Functions of the form 
z= f(x, y) can be represented by a 3D surface, like the one shown in Fig. 22.20, 
which is z(x, y) — 2.5x? — 2.5y?. The two partial derivatives are: 


22.14 Partial Derivatives 493 


Fig. 22.20 Surface of 

z = 2.5х2 — 2.5y? using a 
right-handed axial system 
with a vertical z-axis 


Oz 

IAS Cd 
Ox " 
д 

з. ы 
ду 


where ШЕ is the slope of the surface in the x-direction, as shown іп Fig. 22.21, and 
i is the slope of the surface in the y-direction, as shown in Fig. 22.22. 
The second-order partial derivatives are: 


p 

axi = 5 = +уе 
922 

ay? = —3 = —ve 


2. . ТЕ $ аЙ А В А 25. . 
As as is positive, there is a local minimum in the x-direction, and as ay is negative, 


there is a local maximum in the y-direction, as confirmed by Figs. 22.21 and 22.22. 
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Fig. 22.21 z describes the 


slopes of these contour lines 


Fig. 22.22 az describes the 
у . 
slopes of these contour lines 
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22.14.2 Mixed Partial Derivatives 


We have seen that, given а function of the form f (и, у), the partial derivatives a and 
a provide the relative instantaneous changes in f and и, and f and v, respectively, 


whilst the second independent variable remains fixed. However, nothing prevents us 


from differentiating ar with respect to v, whilst keeping u constant: 


9 @ 
av x.) 


д2 f 
dvou 


which is also written as: 


and is a mixed partial derivative. For example, to find the mixed partial derivative 
of f, given: 
f(u, у) = изу“ 


we have: 3 
af = Зи2у* 
ди 
апа 5 
ШЕ gs 12и2у3 
дуди 


It should be no surprise that reversing the differentiation gives the same result: Let: 


f(u, у) = изу“ 


then: 3 
9f Ay? 
ду 

апа А 
ОРИ 1242? 
диду 


Generally, for continuous functions, we can write: 


92; A 92; 
диду дуди 
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22.15 Chain Rule 


Earlier, we came across the chain rule for computing the derivatives of functions of 


functions. For example, to compute the derivative of у = sin? x we substitute и = x?, 


then: 
yu 
y 
—— = cosu 
u 
= cos(x?) 
du. 
Next, we compute Tz: 
и = х? 
аи 
— = 2х 
ах 


апа ov is the product of the two derivatives using the chain rule: 


dy ау du 

dx du dx 
= cos(x?)2x 
= 2% cos(x?) 


But say we have a function where w is a function of two variables x and y, which in 
turn, are a function of u and v. Then we have: 


w= f(x, y) 
x=r(u, v) 


у = su, v) 


With such a scenario, we have the following partial derivatives: 


Ow ðw 
ax’ ду 
Ow ðw 
ди’ ду 
Ox Ox 
ди’ ду 
dy dy 


ди’ ду 


22.15 Chain Rule 


These are chained together as follows: 


dw dw Ox " ðw oy 
ди 9х ди ду ди 
ди dw Ox " Ow ду 
ðv ax ду dy ду 
For example, to find ow and oe given: 
и = f (2x +3y) 


х= г(и? + у?) 


у= =(и2 — у?) 


we have: 


Ox 
ди 
ду _ 
ðu 


2u, 


ди 
ду 
дх 


ду 


ду 
ду 


= 2у 


= —2у 


and plugging these into (22.6) and (22.7) we have: 


Ow | Ow Ox ду ду 

du Әх ди Әу ди 
-2х2и--3х2и 
= 10и 

Ow dw Ox ду ду 

dv Әх ду ду ду 
-2х2у--3 х (-2у) 
= —2v 

Thus, when и = 2 and v = 1: 
OW oon and ----2 
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(22.6) 


(22.7) 
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22.16 Total Derivative 


Given a function with three independent variables, such as w = f(x, y, t), where 
x = g(t) and y = h(t), there are three primary partial derivatives: 


dw | Ow ðw 
ax’ ду’ Ot 


which show the differential change of w with x, y and f respectively. There are also 
three derivatives: 


dx dy dt 
dt' dt' dt 
where а = |. The partial and ordinary derivatives can be combined to create the 


total derivative which is written: 


dw dwdx  Owdy д» 
dt дх а дуа at 


4” measures the instantaneous change of w relative to t, when all three independent 


dt 
variables change. For example, to find w, given: 


их? +xy +y +12 


х= 2t 
y=t-1 
we have: 
dx 
калкан, 
dt 
d 
wai 
dt 
ðw 
—— =2x+y=4+t-l1=5t-1 
Ox 
д 
3; = +352 =21+3( 1)2 = 302-4 +3 
у 
д 
ЕЕ ад 
ot 
dw dw ах dw dy д 


т 


(5t — 1)2 + (312 — 4t - 3) + 21 = 312 + 81 +1 
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and the total derivative equals: 


dw 2 
— = 3t^ + 81 + 1 
аі 


and when t = 1, dw/dt = 12. 


22.17 Power Series 


A power series is an infinite string of nomial (a single algebraic name) terms with 
increasing powers. A general form is written: 


oo 
nx 2 3 
ах = ао + ах + ax" + aax +... 
n=0 


where а, and x” are generally real quantities. And because each term is individually 
simple, they are relatively easy to differentiate and integrate. For example, given: 


y = ao + ax + ах? + ax? + ах® +... 


а 
ах? = a, + 2а›х + Зазх?  Aaax? +... 


An excellent example is found in the exponential function e*: 


In particular: 


l1. . = 1 1 
ны ыы ТЕН 


1 
ОЖ 


1 
it] 


1 
(373 


2 
In 1715, the English mathematician Brook Taylor (1685-1731) published Methods 
Incrementorum Directa et Inversa which contained a theorem concerning power 
series. Today, this is known as Taylor’s theorem, and the associated series: Taylor’s 
series. Lagrange recognised its importance and called it ‘the main foundation of 
differential calculus’. 

Taylor proposed that any reasonable function, such as sinx and cosx can be 
written as a power series: 


sinx = dg tayx + ajx? + a3x° + адх“ + asx’ + asx? Tee 
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To find the values of ао, a), a», etc., we proceed as follows. When x = 0, sin 0 = 0, 
which implies ао = 0. Therefore: 


sinx = ах + арх? + азх? + agx* + ах? + авхб +... (22.8) 
Differentiating (22.8) we get: 
ы ТТ + 2а›х + Зазх? + 4a4x? + 5asx^ + 6agx? +... 
dz 1 2 3 4 5 6 
When x = 0, соз 0 = 1, which implies a, = 1. Therefore: 
COS x = ay + 2ax + Зазх? + Aa4x? + 5asx^ + 6agx? +- (22.9) 


Differentiating (22.9) we get: 


d 
d. cosx = — sin x = 2a» + базх + 12а4х? + 20a5x? + 30agx* +... 
х 


When x = 0, — $100 = 0, which implies аҙ = 0. Therefore: 


— sin x = базх + 12a4x? + 20asx? + 30agx* +... (22.10) 
Differentiating (22.10) we get: 
d Р 2 3 
ax sin x) = — cos x = баз + 24a4x + 60asx^ + 120agx^ +--- 
х 


When х = 0, — cos 0 = —1, which implies аз = –{. Therefore: 
— cos x = баз + 24aax + 60asx? + 120agx? +... (22.11) 
Differentiating (22.11) we get: 
d | 2 
d COS х) = sin x = 24a4 + 120asx + 360ах + --- 


When x = 0, sin 0 = 0, which implies a4 = 0. Therefore: 


sinx = 120asx + 360agx? +... (22.12) 


Differentiating (22.12) we get: 
а. 
2 sin x = cos x = 120а5 + 1080agx + --- 
x 


When x = 0, cos x = 1, which implies a5 = 17. 
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We now have ао = 0,41 = 1, a2 = 0, аз = –{, ад = 0, and а5 = To which means 
that the original sin x function comprises only the odd powers of x, with alternating 


signs. This permits us to write: 


Conversely, the cos x function comprises only the even powers of x, with alternating 
signs. This permits us to write: 


It is clear that sin x and cos x are closely related to e*: 


1 x? x? x4 x? хб x! x8 x? 


x X 
жа т + Ня + 


oes хэ x x! y? 

ОЕ ТОТА a at 
x? x4 хб x8 

cosx = 1 + doxes 


TD ix! x? d x^ id x6 dx x9 іу) 

C vom np umo Er bow 
254 x? x^ x98 x8 ix! ix? іх? dx? іх? 
ctu acte tug to Pap ue ctae car uar 
= 1 х? х* хб x8 . x! x? x? x? x? 
cur Pape ee epe Е 7 


= cosx + isinx 
which is Euler's trigonometric formula. 
22.18 Worked Examples 


22.18.1 Antiderivative 1 


Given 2* = 1, find y. 
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Solution 
Integrating: 


y= f tax 


=x+C 


22.18.2 Antiderivative 2 


Given 2 = 6x? + 10x, find y. 


Solution 
Integrating: 


y= | в + 10% dx 


= 2х3 + 502 + С 


22.18.3 Differentiating Sums of Functions 


Differentiate у = 2х6 + sinx + cos x. 


Solution 
Differentiating: 


у = 2х6 + sin x + cos x 
dy 


= 12x" + cos x — sin x 
dx 


22.18.4 Differentiating a Function Product 


2 


Differentiate y = x^ sin x. 


Solution 
Differentiating: 
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22.18 Worked Examples 


Fig. 22.23 Graph of y 
y= x? sin x (blue) and its | ^ 
derivative, e = x2cosx4- | 

2x sin x (green) 
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іп 0 
2 
АН 
du 
— = 2х 
ах 
v=sinx 
dv 
— = cosx 
dx 
dy dv du 


v 
dx dx dx 


= x? cosx + 2x sin х 


dy 


2 py 
> dx 


Figure 22.23 shows a graph of у = x^sin x and its derivative 


2x sin x. 


22.18.5 Differentiating an Implicit Function 


Differentiate x? — у? + 4x = бу. 


Solution 
Differentiating: 


x? — y? +4х = бу 


dy dy 
2x — 2y — = 6— 
К Уах Ы ах 
Rearranging the terms, we have: 
y dy 


— 
RY 


= x? cos x + 
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= © 6 42y) 
^ dx 4 
dy 2x+4 
dx 6--2у 


If we have to find the slope of x? — y? + 4x = бу at the point (4, 3), then we simply 
substitute x = 4 and y = 3 іп ay to obtain the answer 1. 


22.18.6 Differentiating a General Implicit Function 


Differentiate x” + y" = a". 


Solution 
Differentiating: 
x" + у" — а" 
dy 
nx" 4 n=1 77 =0 
dx 
dy | nx" 
dx пу"! 
x" 
m у"! 


22.18.7 Local Maximum or Minimum 


Given у = —3x? + 9x, find the local minimum and maximum for у. 


Solution 
The first derivative is: 


y = —3x3 + 9x 
d 
Е = 9x2 +9 
and second derivative: 
а?у 
1х2 = -18х 


as shown in Fig. 22.24. For a local maximum or minimum, the first derivative equals 
Zero: 


—9x7+9=0 


к- 


which implies that x = +1. 
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Fig. 22.24 Graph of 

y = —3x3 + 9x (blue), its 
first derivative, 

2 = —9x? + 9 (green) and 
its second derivative 


y = —18x (red) 


The sign of the second derivative determines whether there is a local minimum 
or maximum: 


d? 
ZY 18x 
dx? 
= —18 x (-1) = +уе 
= —18 x (+1) = —ve 
therefore, when x = — 1 there is a local minimum, and when x = +1 there is a local 


maximum, as confirmed by Fig. 22.24. 


22.18.8 Partial Derivatives 


Find the second-order partial derivatives of f, given: 
f(u, v) = sin(4u) - cos(5v) 


Solution 
The first partial derivatives are: 


f(u, v) = sin(4u) - cos(5v) 
af 
ди 
af 
av 


= 4cos(4u) - cos(5v) 


= —5sin(4u) - sin(5v) 
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and the second-order partial derivatives are: 


2 
Ef = —16 sin(4u) - cos(5v) 
ди? 

2 
ба = —25sin(4u) · cos(5v) 
ду? 


22.18.9 Mixed Partial Derivative 1 
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Given the formula for the volume of a cylinder is V(r, h) = лг?һ, where ғ and л 
are the cylinder’s radius and height respectively, compute the mixed partial derivative. 


Solution 
V(r, h)=ar-h 
ӘУ 
— =2лһг 
or 
ə? V 
= 2лг 
dhor 
or 
V(r, h) = zr?h 
ӘУ 
UE 
ə? V 
=2лг 
oroh 


22.18.10 Mixed Partial Derivative 2 


Given f(u, v) = sin(4u) - cos(3v), compute the mixed partial derivative. 


Solution 
af = 4cos(4u) - cos(3v) 
ди 
2 
or = — 12 соз(4и) - sin(3v) 
дуди 


or 
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ar = —3sin(4u) - sin(3v) 
ду 
2 
a. = —12cos(4u) - sin(3v) 
диду 


22.18.11 Total Derivative 


Given: 


w=x?+xuyty+t 
x = 2t 


y=t-1 


compute the total derivative w, 


Solution 
dx 
— =2 
dt 
d 
a | 
dt 
ди 
LL =2х+у= 4+1 1 = 51 – 1 
дх 
д 
ЕЕ 
ду 
9 
S en 
ot 


dw ðw dx д» dy ди 
dt əx dt dy dt at 
= (5t—1)2+ 2t+1)+1 


and the total derivative equals: 
dw 
— = 121 
dt 


Chapter 23 A) 
Calculus: Integration geu 


23.1 Introduction 


In this chapter we develop the idea that integration is the inverse of differentiation, 
and explore the standard algebraic strategies for integrating functions, where the 
derivative is unknown; these include simple algebraic manipulation, trigonometric 
identities, integration by parts, integration by substitution and integration using par- 
tial fractions. The chapter concludes with a number of worked examples. 


23.2 Indefinite Integral 


In the previous chapter we have seen that given a simple function, such as: 


y = sinx + 23 
dy 
— = cosx 
dx 


and the constant term 23 disappears. Inverting the process, we have: 


y= | cosx ax 


= sinx + C 


ЕС dx 


is known as an indefinite integral; and as we don’t know whether the original 
function contains a constant term, a constant C has to be included. Its value remains 


An integral of the form: 
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J. Vince, Foundation Mathematics for Computer Science, 
https://doi.org/10.1007/978-3-031-66549-3 23 


510 23 Calculus: Integration 


undetermined unless we are told something about the original function. In this exam- 
ple, if we are told that when x = 1/2, у = 24, then: 


24 = зшл/2 + C 
=1+C 
С = 23 


23.3 Integration Techniques 


23.3.1 Continuous Functions 


Functions come in all sorts of shapes and sizes, which is why we have to be very 
careful before they are differentiated or integrated. If a function contains any form of 
discontinuity, then it cannot be differentiated or integrated. For example, the square- 
wave function shown in Fig. 23.1 cannot be differentiated as it contains discontinu- 
ities. Consequently, to be very precise, we identify an interval [a, b], over which a 
function is analysed, and stipulate that it must be continuous over this interval. For 
example, a and b define the upper and lower bounds of the interval such that: 


a<x<b 
then we can say that for f(x) to be continuous: 


lim for +h) = f(x) 


Fig. 23.1 A discontinuous 
square-wave function 


Sx 


xy 
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Even this needs further clarification as must not take x outside the permitted 
interval. So, from now on, we assume that all functions are continuous and can be 
integrated without fear of singularities. 


23.3.2 Difficult Functions 


There are many difficult functions that cannot be differentiated and represented 
by a finite collection of elementary functions. For example, the derivative f'(x) — 
(sin x)/x does not exist as a function, which precludes the possibility of its integra- 
tion. Figure 23.2 shows this function, and even though it is continuous, its derivative 
and integral can only be approximated. Similarly, the derivative f'(x) = ./x sin x 
does not exist in a precise form, which precludes the possibility of finding a precise 
integral. Figure 23.3 shows this continuous function. So now let's examine how most 
functions have to be rearranged to secure their integration. 


Fig. 23.2 Graph of 


y = (sinx)/x 7A 
2n 0 2n 
ЕЯ J^ P d > 
Ха” е. x 


Fig. 23.3 Graph of 
y = ух sinx p 


Se 


512 23 Calculus: Integration 


23.4 Trigonometric Identities 


Sometimes it is possible to simplify the integrand by substituting a trigonomet- 
ric identity. For example, let's evaluate f sin? x dx, f cos? x dx, f tan? x dx and 
f sin(3x) - cos x dx. 

The identity sin? x = (1 — cos(2x)) converts sin? x into a double-angle form: 


1 
ЕЕ ах- 5] 1 — соѕ(2х) dx 


1 1 
5] dx — 5 / e» ах 


1 1 
5" == 4 ѕ1п (2х) + С 


Figure 23.4 shows the graphs of у = sin? x and у = ix - 1 sin(2x). 
The identity cos? х = +(cos(2x) + 1) converts cos? x into a double-angle form: 


1 
БЕ ах- 2 | e» + тах 


1 1 
5 | e» Ах + 5] dx 


1 1 
4 sin(2x) + 5% + C 


Figure 23.5 shows the graphs of y= cos? x and y = 1 вїп(2х)+»х. 
The identity sec? x = 1 + tan? x, permits us to write: 


Гаа = [ts 
= [хас [14х 


=tanx-x+C 


Figure 23.6 shows the graphs of у = tan? x and у = tan x — x. 


Fig. 23.4 The graphs of 


y 
y= sin? x (green) and A 
y= 1х - І sin(2x) (blue) P. 
2 ЕЕ 22 
WP 1 e 
4 - 4 А7 0 1 2 3 4 5 % 
P d 
и 1 
m 
E 
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Fig. 23.5 The graphs of 
y= cos? x (green) and 
y= i sin(2x) + ix (blue) 


Fig. 23.6 The graphs of 
у = tan? x (green) and 
у = tan x — x (blue) 


ГҮ 
M 


Finally, to evaluate f sin(3x) - cos x dx we use the identity: 
2sina -cosb = sin(a + Б) + sin(a — b) 
which converts the integrand's product into the sum and difference of two angles: 
| 1. А 
sin(3x) - cos x = 2 Enx) + sin(2x)) 

1 

ri sin(3x) - cos x dx = 2 | sin(4x) + sin(2x) dx 

1 | 1 А 

= 2 | sin(4x) dx + 5 l sin(2x) dx 


1 1 
a cos(4x) — а cos(2x) + С 


Figure23.7 shows the graphs of y = sin(3x) - cos x andy = -4 cos(4x) — 1 cos(2x). 
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Fig. 23.7 Тһе graphs of 

y = sin(3x) - cos x (green) 
and y = -i cos(4x)— 

i cos(2x) (blue) 


Sse 


Vd Lm i 2 wax 


23.4.1 Exponent Notation 


Radicals are best replaced by their equivalent exponent notation. For example, to 


evaluate: 
2 
Ух ах 


we proceed as follows: 


The constant 2 is moved outside the integral, and the integrand is converted into an 
exponent form: 


Figure 23.8 shows the graphs of у = 2/4/x and у = 8х1 /3. 


23.4.2 Completing the Square 


Where possible, see if an integrand can be simplified by completing the square. For 


example, to evaluate: 
1 
] 2 
x? — 4х +8 
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Fig. 23.8 The graphs of 


515 


y — 2/4/x (green) and ^ 
y = 8х1 /3 (blue) 4 | 
3 
2 
| c esl 
1 
0 1 | 3 4 5 € 8 X 
m | 


we proceed as follows: 
We have already seen that: 


1 
| 6 = маши + С 
1--х2 


and it’s not too difficult to prove that: 


1 1 х 
———— dx = —arctan| – | +C 
а? + x? а а 


Therefore, if we can manipulate an integrand into this form, then the integral will 


reduce to an arctan result. The following needs no manipulation: 


| 1 d 1 | x n 
x = — arctan | — 
44+ x? 2 2 


However, the original integrand has x? — 4x +8 as the denominator, which is 


resolved by completing the square: 
x? 4x -8-24-4 (x —2y 


Therefore: 


1 1 
= эле рине 
[=== И ] zr Г 


1 i х-2 +С 
= — arctan | ——— 
2 2 


Figure 23.9 shows the graphs of y = 1/(x? — 4x + 8) and y =( arctan =? ) /2. 
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Fig. 23.9 The graphs of y 
y =1/@? — 4x +8) oa’ 
(green) and E 
х-2 
y = (arctan 52) /2 (blue) m 
3 -2 1 0 1 3 4 5 6 > 
-0.2 
-0.4 
Fig. 23.10 The graphs of Ж | сй = 
у = 1/(х2 + 6x + 10) ay | 
(green) and 
y = arctan(x + 3) (blue) 
a | ieee 
7 6 5 4 -2 1 0 1 2 x 
-1 
Lae 


To evaluate: 1 
— d 
J х2 + 6бх+10°^ 


we factorise the denominator: 


1 1 
| imt oe © 
x? + бх + 10 12 + (x +3)? 
= arctan(x + 3) + C 


Figure 23.10 shows the graphs of у = 1/(x? + 6x + 10) апа у = arctan(x + 3). 


23.4.3 The Integrand Contains a Derivative 


f(x) 
d. 
J foe 


An integral of the form: 


23.4 Trigonometric Identities 


is relatively easy to integrate. For example, let’s evaluate: 


arctan x 
1--х2 
Knowing that: 
d 
dx arctan x — 1x2 
let и = arctan x, then: 
du 1 
dx 1+x2 


and 


arctan x 
dx = f udu 
1+ x? 


z (arctan х)? +C 


Figure 23.11 shows the graphs of у = arctan x/(1 + x?) and у = 1 (arctan х)?. 


An integral of the form: 
iO. 
f(x) 


is also relatively easy to integrate. For example, let’s evaluate: 


cos x 
- ах 
sin x 


Knowing that: 
а. 
— sin x = cosx 
dx 


let u = sin x, then: 


du 
— =cosx 
dx 
and 
cos X 1 
| - а= | а 
sin x и 
= ш|и|+ С 
= ln | sin x| + C 


Figure 23.12 shows the graphs of y = cos x/ sin x and y = In | sin x|. 
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Fig. 23.11 Тһе graphs of 
у = arctan x/(1 + x?) 
(green) and 

y= 1 (arctan x)? (blue) 


ЗА 


| 


Fig. 23.12 The graphs of y 
у = cos x/ sin x (green) and | ^ | | 
y = In|sin x| (blue) 2 
1 
а ГУ 0 401 Va 
ul M \/ 
|| | | \ ( 
| i | ІІ 


23.4.4 Converting the Integrand into a Series of Fractions 


Integration is often made easier by converting an integrand into a series of frac- 
tions. For example, to integrate: 


[ 4x3 + x? — 8+ 12x cos x 
dx 
4x 


we divide the numerator by 4x: 


4х3 2—8 12 2 
f хЗ+х 4 X COSS io ЖЕКЕ 
4x 4 " 


Ото 
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Fig. 23.13 The graphs of 
у = (4х3 +x? — 8+ 

12x cos x)/4x (green) and 
у = x3/34x7/8— 
21n|x| + 3 sin x (blue) 


S 


| 
= 


Figure 23.13 shows the graphs of y = (4x? + x? — 8 + 12x созх)/4х and у = 
x? /3 + x?/8 — 2In|x|+3sinx. 
23.4.5 Integration by Parts 


Integration by parts is based upon the rule for differentiating function products 
where: 


therefore: 


ww | аах f wdx 
Гао f wax 


Thus, if an integrand contains a product of two functions, we can attempt to integrate 
it by parts. For example, let’s evaluate: 


ЕСЕ ах 


which rearranged, gives: 


In this case, we try the following: 
u=x and v = ѕіпх 


therefore: 
и —1 and v=C;—cosx 
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Fig. 23.14 The graphs of y 
у = x sin x (green) and A Д. 
у = —x cos x + sin x (blue) ЖЕ 


2 
Integrating by parts: 


Ди ах = ии — f wax 


| mae COS X) [€ cos x) (1) dx 


= Сх —xcosx — Cix + sinx + C 


—x cos x + sinx + C 


Figure 23.14 shows the graphs of y = x sin x and y = —x cos x + sin x. 
Note the problems that arise if we make the wrong substitution: 


u=sinx and v =x 


therefore: 4 


, x 
и = соѕх and Шеке 


Integrating by parts: 


Гао f wax 
x? x? 
ЕСЕ а (+0) -[(®+<) cos x dx 


which requires to be integrated by parts, and is even more difficult, which suggests 
the substitution was not useful. 
Now let’s evaluate: 
f x? cosx dx 
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In this case, we try the following: 


и=х? and у’ = cosx 


therefore: 
и'=2х and v= sinx + Ci 


Integrating by parts: 


ах = ии аа 


fe cosx dx = x° (sin x + Су) — 2 [етл + С)х ах 
=? пя + Сы? = 20, | хах -2 | xsinx dx 
x2 
= x° sinx + Cix? — 2C, ( ужо) -2 f xsinx as 
= x? sinx — с-2 | xsinx dx 
At this point we come across f x sin x dx, which we have already solved: 


ps cos x dx = x? sin x — Сз — 2(—x cos x + sin x + C4) 


= x? sin x — Сз + 2x cos x — 2sinx — С; 


= x?sinx + 2х cos x —2sinx + C 


2 2 


Figure 23.15 shows the graphs of y = x^ cos x and y = x^ sin x + 2x cos x — 2sin x. 


Fig. 23.15 The graphs of 


y= x? cos x (green) and 


y = х2 sin x + 2x cos x 


2 sin x (blue) 2 


IAL 


| —— 
>< 
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Fig. 23.16 The graphs of 


y 
y = x ln x (green) and A 
y= ix? Inx — ix? (blue) 6 
4 
Ж 
/ 
2 / 
1 о = 2 3 4 X 
-2 


Now let’s evaluate: 


БЕ ах 


u=Inx and v =x 


In this case, we try the following: 
therefore: 


Integrating by parts: 


f imm f wax 
1, 1\1 
xInx dx = -x° lnx x dx 
2 2 x 
E. 2| >| d 
=” nx 5 хах 


1 1 
= na = QU +С 


Figure 23.16 shows the graphs of y = x In x and y = ix? Inx — ix’. 


Finally, let’s evaluate: 
f V1+x2 dx 
Although this integrand does not look as though it can be integrated by parts, if we 


rewrite it as: 
| V 14 x?(1) dx 


then we can use the technique. 
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Let: 


= ү1+х2 and v = 1 


therefore: 


Integrating by parts: 


Геж-ш- fou as 


ШЕН |== 


v1 ++ a 
Now we simplify the right-hand integrand: 
[ 9e = лам Ее 
1 js au 
—x/l4x?— |= + f- 
М1 + r2 


оо 


Now we have the original integrand on the right-hand side, therefore: 


2 | Ve as m UE + asinis С) 
1 1 
Га = 5ХУ 1 +x? + garsinhx + С 


Figure 23.17 shows the graphs of у = У1 + x? and y = $x/1 + x? + jarsinhx. 


23.4.6 Integration by Substitution 


Integration by substitution is based upon the chain rule for differentiating a function 
of a function, which states that у is a function of и, which in turn is a function of 


x, then: 
dy dy du 


dx du dx 
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Fig. 23.17 The graphs of 


У, 
y = У1+ x? (green) and 4 
y= ixl +x? + 5arsinhx 
(blue) 2 
1 ^ 
> 
4 3 2 P 1 2 8 4х 
4 
Ж 
-2 
-3 


For example, let's evaluate: 


[ x^v x? dx 
This is easily solved by rewriting the integrand: 


[мах = | ETE 


24 4c 
= -xX 
9 


However, introducing a constant term within the square-root requires integration by 
substitution. For example: 


fe хз + Тах 
First, we let и = x? + 1, then: 


du 2 du 
2-08 or dx= z7 


Substituting u and dx in the integrand gives: 
d 
fever ldx= pes 
x 
1 


J ғ 
=> | и? аи 
3 

12, 
о 

2 aod 
= 56 р +C 


Figure 23.18 shows the graphs of у = x?./x3 + 1 and y = $c + 1)2. 
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Fig. 23.18 The graphs of 


y 
y = x? / x? + 1 (green) and ^ 
y = $6? +1)? (blue) 2 
1 
> 
1 0 1 2 x 
zi 
Fig. 23.19 The graphs of Mp 
y = 25іп х - cos x (green) 
and y = sin? x (blue) f 2 


EM 1 б 
т 


Now let's evaluate: 
[sins -cosx dx 


Integrating by substitution we let и = sin x, then: 


du du 
— —cosx or dx = 
dx cos X 


Substituting и and dx in the integrand gives: 


| du 
очах сохах =2 | ucosx 


COS X 
=2 | udu 


= и? +С 
= sin? x +C 


Figure 23.19 shows the graphs of y = 2 sin x · cos x and y = sin? x. 
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23.4.7 Partial Fractions 


Integration by partial fractions is used when an integrand’s denominator contains 
a product that can be split into two fractions. For example, it should be possible to 


convert: 
3x +4 


(х + DG + 2) 


into: 
dx + i d 
— dx — — dx 
x+1 x+2 
which individually, are easy to integrate. Let’s compute A and B: 


3x +4 A B 
G*DG42 xt1 x42 
3x +4 = A(x -2) + B(x + D 
= Ax -2A + Bx + B 


Equating constants and terms in x: 


4=2A+B (23.1) 
3=A+B (23.2) 


Subtracting (23.2) from (23.1), gives A = Тапа В = 2. Therefore: 
3x +4 1 2 
Ге ue — — dx 
(x + 0) (х + 2) х+1 x+2 
= ln(x + 1) - 21n(x 4-2) +С 


Figure 23.20 shows the graphs of y = (3x + 4)/((x + D)(x + 2)) and y = ln(x + 1)4- 


2In(x + 2). 
Now let's evaluate: 
[ 5х —7 
— — — dx 
(x — 1)(х — 2) 
Integrating by partial fractions: 
5x —7 A B 
= + 
(х—1)(х—2) х-і x-2 


5x — 7 = А(х-2)--В(х-і1) 
= Ах + Вх – 2А – В 
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Fig. 23.20 The graphs of 
y = (3х + 4)/((х +1) | 1-3 


(x + 2)) (green) and 
у= (х +1) - 21In(x 4-2) 


(blue) 
| — T 


Fig. 23.21 The graphs of y 
у = (5х – D/(x — П) ^ 
(x — 2)) (green) and y — 
2In(x — 1) + 3In(x — 2) 
(blue) ? 


Equating constants and terms in x: 


—7=—2А-В (23.3) 
5=A+B (23.4) 


Subtracting (23.3) from (23.4), gives A = 2 and B = 3. Therefore: 
3x +4 2 3 
— dx = | ——d — d 
x—Dx-2) ^ [+] 5% 
= 2In(x — 1) + 3ln(x 2) +С 
Figure 23.21 shows the graphs of y=(5x — 7)/((x — 1) (х —2))andy 221n(x — 1)+ 


3In(x — 2). 


23.5 Summary 


Integration appears to be a collection of strategies that convert a function into a form 
recognised by a strategy. However, life is never as simple as this, and one is always 
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confronted with a function that defies a direct solution. So, as a last resort, maintain 
contact with someone who eats and sleeps calculus! 


23.6 Worked Examples 


23.6.1 Integrating a Function Containing its Own Derivative 


Evaluate: . 
sin x 
А ах 
COS X 
Solution 
Knowing that: 
d 
— cos x = — зах 
dx 
let u — cos x, then: 
du | 
— = —sinx 
dx 
du — —sinx dx 
and 
sin x 1 
[ — dx = | —(—1)4и 
COS X u 
= – 1 |и| +С 
= — Ш | cos x| + C 


= ln | cos x|! + C 
= ln | sec x| + C 


Figure 23.22 shows the graphs of y = sin x/ cos x and y = In | sec x|. 


23.6.2 Dividing an Integral into Several Integrals 


Evaluate: 


| 2 sin x + cos x + sec x 
dx 


COS X 
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Fig. 23.22 The graphs of y 
у = sin x/ cos x (green) and || А | | | | 
\ 


у = In| sec x| (blue) i : 7j 


Y 


Solution 
Divide the numerator by cos x: 


2si 
| э ма as INIRE dx=2 | шахах+ f 1ах+ | ea 
cos x 


= 210 | ѕесх| + x + tan x + C 


Figure 23.23 shows the graphs of y = (25іпх + cosx + ѕесх)/ соѕх and 
y = 2ln | sec x| + x + tan x. 


23.6.3 Integrating by Parts 1 


[ x cos x dx 
Fig. 23.23 The graphs of 


у = (2sinx + cos x + | УА 
sec х)/ созх (green) and Е | / J 
Ға 


Evaluate: 


y = 2In|secx|+x+tanx | 


(blue) | 
_ | | KA 


yx 
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Fig. 23.24 The graphs of 
у = x cos x (green) and 
y = x sin x + cos x (blue) 


Solution 
In this case, we try the following: 


f 
u=x and v = cosx 


where: 
u =1 and v= sinx + С, 


Integrating by parts: 


До ах = мо = f wax 


IET dx = xGinx €) — | бак «cot dx 


= xsinx + Cix + cosx — Cix +С 


= xsinx + cosx + C 


Figure 23.24 shows the graphs of y = x cos x and y = x sin x + cos x. 


23.6.4 Integrating by Parts 2 


[ x? sinx dx 
Solution 


In this case, we try the following: 


Evaluate: 


и= х? and v = sinx 
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Fig. 23.25 The graphs of 


у 
у = x? sinx (green) and A 
y = —х2 cosx + 2x sinx + 
2 cos x (blue) 


Ер 

ie № 

>. 
а 


where: 
и! = 2х and о = – cosx + С 


Integrating by parts: 


Гао f wax 


le sin x dx = x?(— cos x + СІ) -2 | = созх coo dx 
= созх + Сы? —2С\ | xdx+2 | xcosx dx 
x 
= —x?cosx + Сух? — 2C, (Z+) +2 fons 
= css — C3 +2 | xcosx dx 


At this point we come across f x cos x dx, which we have already solved: 


[г sinx dx = —x?cosx — C3 + 2(x sin x + cos x + Са) 
= —x? cos x — Сз + 2x sin x + 2cosx + Cs 


= —x?cosx + 2x sin x +2cosx+C 


Figure 23.25 shows the graphs of у = x?sinx and y = —х? cosx + 2x sin x + 
2 cos x. 
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Fig. 23.26 The graphs of 


y = 28%) sin x - cos x гү УА 
(green) апа у = -j есо5(2х) , 
(blue) | 

1 

N п 0 pem = > 
Ж “ P Ға N 
ы NOS 
| V 


23.6.5 Integrating by Substitution 1 


Evaluate: 
f 26595509 сір x . cosx dx 


Solution 
Integrating by substitution, let u — cos(2x), then: 


du . du 
— = —2sin(2x) or dx = —————— 
dx 2sin(2x) 


Substituting a double-angle identity, и and du: 


| а 
| 2659802х) sinx-cosx dx = — 1 e" sin(2x) ———— 
2 sin(2x) 


1 
cos(2x) 
-- +С 
2° 


Figure 23.26 shows the graphs of у = 269909 sin x - cos x and y = — 565992, 


23.6.6 Integrating by Substitution 2 


COS X 
[ MN 0877 
(1 + sinx)’ 


Evaluate: 
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Fig. 23.27 Тһе graphs of 
y = cosx/(1 + sin x)? А 
(green) and 

y = —1/2(1 + sin x)? (blue) 


Solution 
Integrating by substitution, let и = 1 + sin x, then: 


du du 
— —cosx or dx = 
dx cos X 
| oS COS X E x= | cosx du 
(14 Gs i u? cosx 
= |= du 
1 
= и? +C 


1 ЖҰЛА. 
= =, trema) +С 
: +С 
2(1 + sin x)? 


Figure 23.27 shows the graphs of у = cos x/(1 + sin x)? and y = —1/2(1 + sin х)2. 


23.6.7 Integrating by Substitution 3 


[ sin(2x) dx 
Solution 


Integrating by substitution, let и = 2x, then: 


Evaluate: 
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Fig. 23.28 The graphs of 
y = sin(2x) (green) and | 
y= -j cos(2x) (blue) 


du 2 4 и 
— z= or = — 
dx 5 
; 1 р 
К ах = ET du 
: +С 
= —-cos 
2 и 


1 
ix cos(2x) + C 


Figure 23.28 shows the graphs of y = sin(2x) and y = -i cos(2x). 


23.6.8 Integrating with Partial Fractions 


Evaluate: 
| 6x? + 5x —2 
—— dx 
x3 + x2 — 2x 
Solution 
Integrating by partial fractions: 
6x7 +5х-2 _ a B 4 С 
хХ3+х2—2х x x+2 x-1 
6x? + 5x — 2 = A(x + 2)(х — 1) + Bx(x — 1) + Cx(x +2) 
= Ax? + Ax — 2A + Bx? — Bx + Сх? + 2Сх 
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Fig. 23.29 The graphs of y 

y = (6x? + 5х — 2)/ (x3 4 \ 
x? — 2x) (green) and 
у= ах + 2In(x 4-2)4 


3 In(x — 1) (blue) e 


Equating constants, terms in x and х2: 


m = —2A (23.5) 
5=A-B+2C (23.6) 
6=A+B+C (23.7) 


Manipulating (23.5), (23.6) and (23.7): A = 1, B = 2 and C = 3, therefore: 
6x? + 5x — 
2 di d —— — dx —— 14 
IE ХРА cd -/: и: is * ic " 
=Inx мыз — 1) 


Figure 23.29 shows the graphs of у = (6x? + 5x — 2)/(х? + x? — 2x) and y= 
In x + 21n(x + 2) +3 ШС — 1). 


Chapter 24 A) 
Area nds 


24.1 Introduction 


This chapter describes how integration computes the area under a graph, and how 
single and double integration are used to compute surface areas and regions bounded 
by functions. Also in this chapter, we come across Jacobians, which are used to 
convert an integral from one coordinate system to another. To start, let's examine the 
area under a graph. 


24.2 Area Under a Graph 


The ability to calculate the area under a graph is one of the most important discov- 
eries of integral calculus. Prior to calculus, area was computed by dividing a zone 
into very small strips and summing the individual areas. The accuracy of the result is 
improved simply by making the strips smaller and smaller, taking the result towards 
some limiting value. In this section, we discover how integral calculus provides a 
way to compute the area between a function's graph and the x- and y-axis. 


24.3 Calculating Areas 


Before considering the relationship between area and integration, let's see how area 
is calculated using functions and simple geometry. 
Figure 24.1 shows the graph of y — 1, where the area A of the shaded zone is: 


A=x, x>0 
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Fig. 24.1 Area of the y 
shaded zone is A = x 


Fig. 24.2 Area of the 
shaded zone is A = x? 


For example, when x = 4, A = 4, and when x = 10, A = 10. An interesting obser- 
vation is that the original function is the derivative of A: 
dA _ 
dx 


Figure 24.2 shows the graph of y = 2x. The area A of the shaded triangle is: 


1 
A= рае - height 


1 
= =x -2x 
2 


= x 


Thus, when x = 4, A = 16. Once again, the original function is the derivative of A: 


ЧА _ 


Ах = 
Ах n 
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Fig. 24.3 Graph of 


y=VP—x2 


which is no coincidence. 
Finally, Fig. 24.3 shows a circle where x? + y? = r?, and the curve of the first 
quadrant is described by the function: 


y-vr?—x?, хє [0, г] 


The total area of the shaded zones is the sum of the two parts A, and A». To simplify 
the calculations the function is defined in terms of the angle 0, such that: 


x-—rsinO 
and 
y-rcos0 
Therefore: 
2 
0 
P d 
2 
1 . rn. 
Ал = 20 cos 0)(r sin 0) = 4 sin(20) 
А=А, + А 


Ns sin(20) 
=F (6+ с ) 


То show that the total area is related to the function’s derivative, let’s differentiate A 
with respect to 0: 
«А L (1+ cos(20)) = r? cos? 8 
dE cos = r^ cos 
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But we want the derivative ЧА, which requires the chain rule: 
dA Е dA а0 
dx 40 dx 
where: 
ш =r СО80 
40 
ог 
а _ | 


dx  rcosÓ 
therefore: 
ЧА _ r? cos? 0 


— = =r cos = у 
ах ғ cos 


which is the equation for the quadrant. 
Hopefully, these three examples provide strong evidence that the derivative of the 
function for the area under a graph, equals the graph’s function: 


dA 
Ет f(x) 
which implies that: 


A= [ feas 


Now let's prove this observation using Fig. 24.4, which shows a continuous func- 
tion y — f (x). Next, we define a function A(x) to represent the area under the graph 
over the interval [a, x]. 6A is the area increment between x and x + óx, and: 


8A ж f (x) - dx 


Fig. 24.4 Relationship y 
between y = f(x) and A(x) 


a x х+дх X 
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We can also reason that: 

ôA = A(x + ôx) — A(x) & f(x) - bx 
and the derivative ал is the limiting condition: 


dA . A(x + 6x) — A(x) . f(x)-d6x 
— = lim = lim 
dx бх->0 bx 6х—0 bx 


= f(x) 


thus: 


whose antiderivative is: 


Ac = f fc dx 


The function A (x) computes the area over the interval x € (а, b] and is represented 
by: 
b 
A(x) = [ f(x) ах 
which is called the integral or definite integral. 


Let's assume that A(b) is the area under the graph of f(x) for x є [0, 5], as 
shown in Fig. 24.5, and is written: 


b 
A(b) = І f(x) ах 
0 


Similarly, let A(a) be the area under the graph of f(x) for x € [0, a], as shown in 
Fig. 24.6, and is written: 


Fig. 24.5 A(b) is the area y 
under the graph y = f(x) 
for x € [0, b] 
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Fig. 24.6 А(а) is the area y 
under the graph y = f(x) 
for x € [0, a] 


Fig. 24.7 A(b) — A(a) is 
the area under the graph 
у = f(x) for x € [a, В] 


A(a) = | f(x) ах 
0 
Figure 24.7 shows that the area of the shaded zone for x е (а, b] is calculated 
by: 
A= A(b) — A(a) 


which is written: 


b a 
a= feas | f(x) dx 
0 0 


and is contracted to: 


b 
A =| f(x)dx (24.1) 


The fundamental theorem of calculus states that the definite integral: 


b 
І f(x) ах = F(b) — F(a) 
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where: 
FG) = | го) а, х=а 
Е = | го) ах, х= Б 


In order to compute the area beneath a graph of f (x) for x Е (а, b], we first integrate 
the graph’s function: 


F(x) = [roa 
and then calculate the area, which is the difference: 
А = F(b) — F(a) 


To illustrate how (24.1) is used in the context of the earlier three examples, let’s 
calculate the area over the interval x € [1, 4] for y = 1, as shown in Fig. 24.8. We 


begin with: 
4 
А = f ldx 
1 


Next, we integrate the function, and transfer the interval bounds employing the sub- 


, we have: 


4 
stitution symbol | , or square brackets | | . Using 
1 


1 1 


Fig. 24.8 Area under the y 
graph is Т 14х 
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4 
or using | | , we have: 
1 


I will continue with square brackets. 
Now let's calculate the area under the graph of y — 2x for x € [1, 4], as shown 
in Fig. 24.9. We begin with: 
4 
А- | 2х ах 
1 


Next, we integrate the function and evaluate the area: 


4 

a=| * | 

1 

= 16-1 
= 15 


Finally, let's calculate the area under the graph of у = Jr? — x2, which is the 
equation for a circle, for x € [0, r], as shown in Fig. 24.10. We begin with: 


A= І Мг? — x? dx (24.2) 
0 


Fig. 24.9 Area under the 
graph is dr 2x dx 
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Fig. 24.10 Area under the 


. ғ 
graph is Јо Vr? — x? dx jedes 


(ғ51п0, rcos@) 


Unfortunately, (24.2) contains a function of a function, which is resolved by substi- 
tuting another independent variable. In this case, the geometry of the circle suggests: 


x=rsing 
therefore: 
Vr? — x? = rcosó 
and " 
d6 =r cosÓ (24.3) 


However, changing the independent variable requires changing the interval for the 
integral. In this case, changing x е [0, r] into 0 € [0;, 65]: 

When x = 0, rsin; = 0, therefore 0; = 0. 

When x =r, rsin0; = ғ, therefore 0; = 7/2. 

Thus, the new interval is 6 е [0, 2/2]. 

Finally, the dx in (24.2) has to be changed into 40, which using (24.3) makes: 


dx — r cos0 d0 


Now we are in a position to rewrite the original integral using 0 as the independent 
variable: 
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A= ри (ғ соз 6)(r cos 0) 40 
0 


л 


ер cos? Ө 40 
0 


І “71-ы cos(20) 40 
0 


1 2 
Ө + = sin(20) | 
2 0 


which makes the area of a full circle x 7?. 


24.4 Positive and Negative Areas 


Area in the real world is always regarded as a positive quantity-no matter how it is 
measured. In mathematics, however, area is often a signed quantity, and is determined 
by the direction of vertices. In calculus, areas above the x-axis are positive, whilst 
areas below the x-axis are negative. This can be illustrated by computing the area of 
the positive and negative parts of a sine wave. 

Figure 24.11 shows a sketch of a sine wave over one cycle, where the area 
above the x-axis is labelled А}, and the area below the x-axis is labelled А». 
These areas are computed as follows: 


Fig. 24.11 The two areas 
associated with a sine wave 
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Al =] sin x dx 
0 


л 
-| — COS X | 
0 


=1+1 
E 


However, A» gives a negative result: 


2x 
Аз =f sin x dx 
T 


This means that the area is zero over the bounds 0 to 27r: 


2x 
А» =] sin x dx 
0 


Il 
| 1 
| 
Q 
о 
т 
= 
1 
a 


І 
н 
+ 


Consequently, one must be very careful using this technique for functions that are 
negative in the interval under investigation. Figure 24.12 shows the sine wave sin x 
for x е (0, л] and its accumulated area. 


Fig. 24.12 The accumulated 
area of the sine wave sin x 
for x € [0, л] 


548 24 Area 


24.5 Area Between Two Functions 


Figure 24.13 shows the graphs of у = x? and у = x?, with two areas labelled A, and 
A». A, is the area between two functions for x € [—1, 0] and A» is the area trapped 
between the two graphs for x € [0, 1]. These areas are calculated very easily: in the 
case of A, we sum the individual areas under the two graphs, remembering to reverse 
the sign for the area associated with y = х2. For Аҙ we subtract the individual areas 
under the two graphs. 


A» = 


| 
5 Б 
ы 
N 
a 
5 
| 
z 
= 
чә 
a 
ы 


i d 
7304 
PE. 
“ор 


Note, that іп both cases the calculation is the same, which implies that when we 
employ: 


Fig. 24.13 Two areas 
between y = x? andy = х 


3 
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Fig. 24.14 The area 


between у = sin x and Ау Я 
маш : y=sinx 
A 
(0.5236, 0.5) у-0.5 (2.618, 0.5) 
х 
0 1 2 3 % 


b 
A= f Lf (x) — 20] ах 


A is always the area trapped between f(x) and g(x) for x є (а, b]. 

Let’s take another example, by computing the area A between y = sin x and the 
line y = 0.5, as shown in Fig. 24.14. The horizontal line intersects the sine curve at 
x = 30° and x = 150°, marked in radians as 0.5236 and 2.618 respectively. 


150° 51/6 
д= | sinx dx - | 0.5 dx 
30° 1/6 


150° | 57/6 
=| -cosx | zi 
x» 2 л/6 


24.6 Areas with the y-Axis 


So far we have only calculated areas between a function and the x-axis. So let’s 
compute the area between a function and the y-axis. Figure 24.15 shows the function 
y= x? for x € (0, 4], where A, is the area between the curve and the x-axis, and А» 
is the area between the curve and y-axis. The sum A; + Аҙ must equal 4 x 16 = 64, 
which is a useful control. Let's compute Ат: 
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Fig. 24.15 The areas АУ 
between the x-axis and the 16 
y-axis 
a) 
A, y-x 
A, 
x 
0 4 zr 


which means that A; = 42.6. 
To compute А» we construct an integral relative to dy with a corresponding 
1 
interval. If у = x? then x = y? for y € (0, 16]: 


16 | 
“=f y? dy 
0 

| 2 j | 
= =y? 
3 0 
= 264 
73 
- 42.6 


24.7 Area with Parametric Functions 


When working with functions of the form у = f(x), the area under its curve and the 
x-axis for x € (а, b] is: 


b 
a= | f(x)dx 
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However, if the curve has a parametric form where: 
x= f(t) and у= (0 


then we can derive an equivalent integral as follows. 
First: We need to establish equivalent limits t € (о, В], such that: 


a= f(a) and b= f,(B) 
Second: Any point on the curve has corresponding Cartesian and parametric coor- 


dinates: 
x and f(t) 


y= f(x) and fy() 


Third: 
х = f.) 
dx = (ба 
b 
A | f(x)dx 
"в 
al AOFM) dt 
therefore: р 
А =] LORD dt (24.4) 


Let’s apply (24.4) using the parametric equations for a circle: 


X 


M 


—r cost 


r sint 


as shown in Fig. 24.16. Remember that the Cartesian interval is x € (а, b] left to 
right, and the polar interval t € (о, В], must also be left to right, which is why 
x — —r cost. Therefore: 
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Fig. 24.16 The parametric 
functions for a circle 


xY 


fit = тїшї 
Ба) =rsint 
B 
A= AOFM dt 


л 
= І r sint -r sin(t) dt 
0 


r2 л 
= >| 1 — cos(2t) dt 
2 Jo 


п 


0 


227 1. 2 
= 5+ 2 Sint n| 
2 


which makes the area of a full circle x 7?. 


24.8 The Riemann Sum 


Bernhard Riemann made major contributions to various areas of mathematics, includ- 
ing integral calculus, where his name is associated with a formal method for summing 
areas and volumes. Through the Riemann Sum, Riemann provides an elegant and 
consistent notation for describing single, double and triple integrals when calculat- 
ing area and volume. Let's see how the Riemann sum explains why the area under a 
curve is the function's integral. 
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Fig. 24.17 Тһе graph of у 
function f (x) over the 
interval [a, b] 


у= Дох) 


т № hs ha hs hs m hs 


Ас Ах Ах Ах Ах Ax Ax Б 
X0 Xi X2 X3 X4 XS Х X7 XB 


Figure 24.17 shows a function f (x) divided into eight equal sub-intervals where: 


and 
а= Xo < хр < X2 <... < X7 < Xg =b 


In order to compute the area under the curve over the interval (а, b], the interval 
is divided into some large number of sub-intervals. In this case, eight, which is not 
very large, but convenient to illustrate. Each sub-interval becomes a rectangle with a 
common width Ax and a different height. The area of the first rectangular sub-interval 
shown shaded, can be calculated in various ways. We can take the left-most height 
хо and form the product xo Ax, or we can take the right-most height хі and form 
the product хі Ax. On the other hand, we could take the mean of the two heights 
(xo + x1)/2 and form the product (xo + хі) Ax/2. A solution that shows по bias 
towards either left, right or centre, is to let x be anywhere in a specific sub-interval 
Ах, then the area of the rectangle associated with the sub-intervalis f (x7) Ах;, and 
the sum of the rectangular areas is given by: 


8 
A= у, #(хў)Ах; 


і-і 


Dividing the interval into eight equal sub-intervals will not generate a very accurate 
result for the area under the graph. But increasing it to eight-thousand or eight- 
million, will take us towards some limiting value. Rather than specify some specific 
large number, it is common practice to employ п, and let п tend towards infinity, 
which is written: 


A=) fapAx (24.5) 


i=l 
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The right-hand side of (24.5) is called a Riemann sum, of which there are many. For 
the above description, I have assumed that the sub-intervals are equal, which is not 
a necessary requirement. 

If the number of sub-intervals is и, then: 


Ax = 


and the definite integral is defined as: 


b n 
f f(x) dx = lim M ft) ДА 
а noo m 


24.9 Surface of Revolution 


A surface of revolution is a popular 3D modelling technique used in computer 
graphics for creating objects such as wine glasses and vases, where a contour is 
rotated about an axis. Integral Calculus provides a way to compute the area of such 


surfaces using: 
b ау\? 
S= an | ЖОО + (2) dx (24.6) 
å x 


where y = f(x) and is differentiable over the interval x € [a, b]. 

To derive (24.6), consider the scenario shown in Fig. 24.18, where points P and 
Q are on a continuous curve generated by the function y = f(x). The curve over 
the interval x € [a, b] is to be rotated 360° about the x-axis. 

The coordinates of P and Q are (xi, yj) and (xi+1, yj+1) respectively, Лх; = 
Хал — хг, and As; approximate to the arc length between P and О: 


Fig. 24.18 The geometry to 
create a surface of revolution 
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As; © Ү1--(/ (с)? Ax; 


where с is some x е (а, Ь] satisfying Lagrange’s mean-value theorem. 
To compute the area AS; swept out by the line segment РО when rotated 360° 
about the x-axis, we use the mean radius r;: 


Viti + Yi 
SS 
2 
such that: 
AS; 7 217; Asi 


йе (=) М Gt Ax, 


As Ax; — 0, у: © y; © f (c), therefore: 


AS; © 2n f (OA 1+ CPC)! Ax; 


Consequently, the total area swept by the arc about the x-axis is: 


S = lim у "2nf (O41 + (ОУ Ах 


i=1 
b 2 
S= 2x | Ро) 1+ (2) ах (24.7) 
ü ү ах 


Similarly, the total area swept by the arc about the y-axis is: 


b ах\? 
$=2л | fo) (2) dy (24.8) 


Let’s use (24.7) and (24.8) with various functions. 


24.9.1 Surface Area of a Cylinder 


To compute the surface area of a cylinder we employ the geometry shown in 
Fig. 24.19, where a straight horizontal line is rotated 360? about the x-axis. The 
function is simply у = r, and x € [0, A]. As y = r, dy/dx = 0, and: 
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Fig. 24.19 Surface area of a 
cylinder 


b 2 
5-2 | f(x) (2) dx 
а ү ах 
һ 
=2nr | 1 ах 
0 
һ 
= an | 
0 


= 2лгй 


which is correct. 


24.9.2 Surface Area of a Right Cone 


To compute the surface area of a right cone we employ the function y = rx/h, 
where r is the cone's radius and h its height, as shown in Fig. 24.20. Therefore: 


Fig. 24.20 The geometry 
used to compute the surface 
area of a right cone 
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7 

= -х 
у= 
dy r 
dx h 


| 
һә 


hy г? 
л A i 1 + m dx 
Qnr (^ [h2 +P 
—— X dx 
h Jo h2 
h 
= — | xyk +r dx 
0 


2 h 
= | x dx 
0 


| 2115 1,5 
“ов 2 
= ЛҮ 


which is correct. 
Reversing the line’s slope to y = r(1 — x/h) as shown in Fig. 24.21 we have: 


(е) 


Fig. 24.21 Surface area of a 
right cone 
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Fig. 24.22 The surface of a 
right cone created by 
sweeping a line about the 
X-axis 


X 
h Jo h 


Figure 24.22 shows a view of the swept conical surface. 


24.9.3 Surface Area of a Sphere 


The surface area of a sphere is 5 = 4л r?, and is derived as follows: 
Figure 24.23 shows a unit semi-circle and Fig. 24.24 shows the surface of revolu- 
tion when this is swept 360? about the x-axis. The equation of a circle is x? + y? = ғ? 


for x € [—r, r] therefore: 
Ха) = у= ут а? 
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Fig. 24.23 A unit 
semi-circle 


Fig. 24.24 The surface of 
revolution formed by 
sweeping a semi-circle 
through 360° 


To find f’(x), let: 


559 


u-—r^—x 
du 
— 2-2 
dx 4 
у= уи 
dy 1 р 1 1 
— = –и = = 
du 2 2/и 2412 х2 
dy ау du _ 1 (2x) = —х 
dx du ах 2/p2— x2 dm Us m 
which is substituted in (24.6): 
b 2 
d 
5=2л | ғо) 1+ (2) ах 
А ах 
г | ын 2 
=2л] yr? — x? [14 (==) dx 
—r Jr? — x? 
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r 2 
=2л | yr-x? (ao ЭШ 
= \ r2—x 


5 r 

=2n | yr? — x2. dx 
-r м2 — x? 

= эл, | 1 ах 


== заң, 


= 2лг2г 


= 4лг? 


24.9.4 Surface Area of a Paraboloid 


To compute the surface area of a paraboloid we rotate the parabola function y = x 


about the y-axis, as shown in Fig. 24.25. 


2 


yur 
х= у 
dx 1 


ау 2 


Fig. 24.25 А parabola to be y^ 
rotated about the y-axis 


МУ 
b | 2 
5-2 | fo) i (S) dy 


24 Area 


2 


gm 
у 
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== | vs +05) ® 


1 
1 
=2 1+—d 
|5 Tug 6) 


== | а Ей. 
=== | ба 


=x Ау + 14у 
0 


Let u = 4y + 1, therefore, du/dy = 4, ог dy = du/4. The limits for и are | апа 5: 


5 = — Ju du 


-1(у®-У®) 
E(B) 
А 5.33 


Figure 24.26 shows a similar parabolic surface. 


Fig. 24.26 A parabolic 
surface 


562 24 Area 


24.10 Surface Area Using Parametric Functions 


The standard equation to compute surface area is: 


b dy 2 
5 = әл | Ро) 1+ (2) ах (24.9) 
a dx 


where the curve represented by f (x) is rotated about the x-axis. In order to convert 
(24.9) to accept the following parametric equations: 


х = а) 
JOS AO 


First, we need to establish equivalent limits (о, В] for t, such that: 
а = (а) and b= f,(B) 
Second, any point оп the curve has corresponding Cartesian and parametric coordi- 


nates: 
x and f(t) 


y=f() and fy(t) 


Third, we compute dy/dx from the individual derivatives dx / 4! and dy/dt: 


dx dt dx 


dy dy dt 
which means that (24.9) can be written as: 


P dy dt 2 
5-2 | seo (2 dx 


B dxdt)? dydty? 
кей | кір | (dx т m STAR 


В ахү  (dy\?d 
= 20 | se (=) + (%) = dx 
B 2 2 
seas f s (=) +(2) a ало) 


For example, to create а unit-sphere from the parametric equations for a semi-circle 
we have: 
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x = f(t) = —cost 
у = fy(t) =sint 

Ах . 

— = sinf 

dt 

d 

2y = cost 

dt 


B dx\* dy 
s=2 | AO (2) +(2 


л. 
=2л І sin tv sin? t + cos? t dt 


0 


л 
= an | sint dt 
0 


= —2л | өз | 
0 


= 2л(1+1) 


= 4л 


which is correct. 
To rotate about the y-axis (24.10) becomes: 


2 
E 


B 2 2 
s=2 (so (8) + (8) a 
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Up to this point we have only employed single integrals to compute area, but just as 
it is possible to differentiate a function several times, it is also possible to integrate 


a function several times. For example, to integrate: 


z= f, у) = х?у 


over the interval x € [0, 3], then we write: 


3 3 
| f(x, у) ах =} xy dx 
0 0 


But say we now want to integrate 9y over the interval y € [0, 2], we write: 
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2 2 
[ovay=9 | y dy 
0 0 
1 2 
=9| -у? 
ЕШ 


= 18 


These two individual steps сап be combined in the form of a double integral: 


2 73 
І І x?y dx dy 
о Jo 


where the inner integral is evaluated first, followed by the outer integral: 


3 


2 (3 2r1 
[ао | Ea ydy 
о Jo о L3 0 
2 
=°/ уау 
0 


2 p2 2 
3 l 2 3 
3àxy dx dy = 3 =x y dy 
0 Ji o L2 1 


Area 
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24.12 Jacobians 


In spite of a relatively short life, the German mathematician Carl Gustav Jacob Jacobi 
(1804—1851) made a significant contribution to mathematics in the areas of elliptic 
functions, number theory, differential equations and in particular, the Jacobian matrix 
and determinant. 

The Jacobian matrix is used in equations of differentials when changing vari- 
ables, and its determinant, the Jacobian determinant, provides a scaling factor 
in multiple integrals when changing the independent variable. I will provide three 
applications of the determinant, showing its use in one, two and three dimensions. 


24.12.1 ID Jacobian 


In order to integrate some integrals, we often have to substitute a new variable. For 
example, to integrate: 


4 
f V2x + 1 4х 
1 


it is convenient to substitute и = 2x + 1, where du/dx = 2 ordx/du = 1/2, calcu- 
late new limits for и: i.e. 3 and 9, and integrate with respect to и: 


4 9 dx 
| verias | „лаи 
1 3 du 
1 9 
-3/ Ми du 
2 J3 


1 9 
= zS ul? du 


9 
E. | zu | 
23. |, 
_ 1! (93/2 — 33/2) 


А —(27 — 5.2) 
А 7.3 


The factor 1/2 is introduced because x changes half as fast аз и. This scaling factor is 
known as a Jacobian, and is the derivative dx /du. We can also write it as дх /du, even 
though there is only one variable, as the partial notation keeps the Jacobians consistent 
as we increase the number of dimensions. Furthermore, we are only interested in the 
magnitude of the Jacobian, not its sign. 
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The scaling factor could also be another function. For example, to integrate: 


2 x ч 
І ааа” 


it is convenient to substitute и = x? + 2, where du/dx = 2x or dx/du = 1/(2x), 
calculate new limits for и: i.e. 2 and 6, and integrate with respect to и: 


f x dcs 6 x as 
s rD AE du 


1 6 x 
Ox 
pu 
- 5] Td 
6 
= 5 | «а 
М Ее ші 
ЕЕ 
1/1 1 
=5(-3+5) 
xj 
6 


In this case, the scaling factor is 1/(2х), which is the corresponding Jacobian, how- 
ever, this time its value is a function of x. 


24.12.2 2D Jacobian 


When defining double integrals using Cartesian coordinates, one normally ends up 
with something like: 


b 
| Е 


where dx dy is regarded as the area of an infinitesimally small rectangle, and is often 
represented by dA. But if we move from Cartesian coordinates to polar coordinates 
and work with functions of the form g(p, @), there is a temptation to substitute 
g(e, 0) for f(x, y) and (ар d0) for (dx dy), which is incorrect. The reason why, 
is that the differential area of a rectangular region in Cartesian coordinates does 
not equal the differential area of a corresponding region in polar coordinates. The 
Jacobian determinant provides us with the adjustment necessary to carry out this 
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substitution, which in this case 15 р, and (dx dy) is replaced by (р ар d0). I will 
describe a general solution to this problem, which is found on various Internet 
websites (http://mathforum.org/dr.math/). 

Figure 24.27 shows an infinitesimally small rectangle defined by the points 
C,C2C3C, in Cartesian coordinates. The vertical broken lines identify lines of con- 
stant x, and the horizontal broken lines identify lines of constant y. The rectangle’s 
width and height are dx and dy, respectively, which makes dA = dx dy. Similarly, 
Fig. 24.28 shows an infinitesimally small rectangle defined by the points P, P; Рз P4 
in another coordinate system. The vertical broken lines identify lines of constant и, 
and the horizontal broken lines identify lines of constant v. The rectangle’s width 
and height are du and dv, respectively. 

We now create two single-valued functions mapping parametric coordinates 
(u, v) into Cartesian coordinates (x, y): 


x = f(u, v) and y= gu, v) 


where for every (x, y) there is a unique (и, у). There are also two single-valued 
functions mapping Cartesian coordinates (x, y) into parametric coordinates (и, у): 


u = F(x, y) апа у= G(x, y) 


Fig. 24.27 Тһе rectangle y 
C1 C35C3C4 in Cartesian : constant xi 
space | ; 


Fig. 24.28 The rectangle (0 
Ру P) Рз P4 in parametric | constant и 
ao Py Ps 


шаласы E NE ee 
| i constant v 
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Fig. 24.29 The parametric ab 
points Ру P» Рз Рд in v nS 
Cartesian space co™ “ 
` Pu 
` — №------ жебе 
I \ constant v 


For example, given: 


2 


u+v u— v 
х= | апа у = 
2 2 


Next, we take the points in иу-ѕрасе and map them into their corresponding Cartesian 
points as shown in Fig. 24.29. The resulting shape depends entirely upon the nature 
of the mapping functions f(u, v) and g(u, v); however, we anticipate that they are 
curved in some way and bounded by contours of constant и and у. 

If the area of this differential region equals the Cartesian rectangle dx dy, then 
dx dy can be replaced by du dv. If not, we must compensate for any stretching or 
contraction. The problem therefore, is to compute the area of this curvilinear rectangle 
Pı P2 P3P4 in Fig. 24.29 and compare it to the area of the rectangle C? C2C3C4 in 
Fig. 24.27. This is resolved by assuming that when this rectangle is infinitesimally 
small, curves can be approximated by lines, and the area of the triangle РІ P» P4 is 
half the area of the required region. The area of the triangle is easily computed using 
the determinant: 


и=х? + у? and v=x 


then: 


1 111 
= | Хі X2 XA 
У! У2 У4 


where (хі, yi), (х2, yz) and (x4, уд) are the triangle’s vertices taken in anticlockwise 
sequence. Reversing the sequence, reverses the sign, which is why the absolute value 
is added at the end of the proof. However, if we assume that the area of the curvilinear 
region is twice the area of the triangle, then: 


111 
Area of (Р; Po Ps P4) = dA, = | X1 X2 х4 (24.11) 
У! У2 У4 
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The next stage is to derive a function relating the differentials dx and dy with 
du and ау, so that the triangle’s coordinates can be determined. These functions аге 
simply the total differentials for f and g: 


x= flu, v) 
y = gu, v) 
Ox Ox 
dx = —d —d 
x 3 u+ a y 
dy dy 
dy = —d —d 
y ди ae ду 


As with many mathematical solutions we сап save ourselves a lot of work by 
making a simple assumption, which in this case is that the coordinates of P, are 
(хі, yı), and the coordinates of P» and Р; are of the form (x; + dx, у + dy). 

Starting with P) with coordinates (x2, y2), then: 


у = у tdy 
д 
х = х + ж + СЕЛ 
и ду 
y dy 
= —d —d 
y2=y + du и + àv 


but as P; and P lie on a contour where v is constant, dv = 0, which means that: 


д 
хә = X1 + у 
ди 


д 
SJ 7? du 
ди 


Next, P4 with coordinates (x4, уд), then: 


x4 = xı + dx 


у = у + dy 
Ox Ox 

x4 = м + з + У 
д д 

уд = yı + С dy + % ау 
ди ду 


but as P; and P4 Пе оп a contour where и is constant, du = 0, which means that: 


Ox 
Ха = х + —dv 
ду 
ду 


а 
ras 


Уу = у + 
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We now plug the coordinates for Ри, P; and Р; into (24.11): 


1 1 1 
à дх g à 5х 2 
Xi xy + аи xi 
аА = |! 717 ди т 


ду 


ду 
а —d 
ди w ср ду қ 


yı yt 
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Rather than expand the determinant, let’s simplify it by subtracting column 1 from 


columns 2 and 3: 


1 0 0 
дх Ox 
dA = | сты 3,2" 
dy dy 
—du —d 
л ди * ду қ 
which becomes: ax Ox 
—du --ау 
dA ди ду 
д д 
2» dü 9 dy 
ди ду 


The determinant now contains the common term du dv, which is taken outside: 


Ox Ox 
ди ду 
аА! = dud 
1 ay ay иау 
ди ду 
Finally, we write this as: 
_ 9, y) 


dA, = du dv = |J| du dv 


д(и, v) 


where J is the Jacobian determinant: 


Ox Ox 
_|du ду 
= ду ду 
ди ду 


Therefore, for the region R, we can write: 


I F(x, »dxdy- | | F (f(u, у), gu, v)) |J| du dv 
R(x,y) R(u,v) 
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Let’s evaluate J for converting Cartesian to polar coordinates, where: 


x = рсоѕ0 
y=psind 
therefore: 
Ox 
—=cosdé, — = —psinÜ, — = 5100, —- = pcos0 
др д 


cos —psin@ 
sind ocos0 


= p cos? 0 + p sin? 0 
= р 


therefore, dx dy is replaced by р dp 40. 


24.12.3 3D Jacobian 


The Jacobian determinant generalises to higher dimensions, and in three dimensions 
becomes: 


Ox Ox Ox 
ди ду dw 
J= 2 D m (24.12) 
ди ду dw 
Oz Oz OZ 
ди ду dw 


and is used in with triple integrals for calculating volumes. For example, it is possible 
to show how a triple integral using spherical coordinates is converted into Cartesian 
coordinates using the appropriate Jacobian. For the moment, let’s evaluate the Jaco- 
bian determinant. Figure 24.30 shows the convention used for converting the point 
(x, y, 2) into spherical polar coordinates (о, Ф, 0). From Fig. 24.30 we see that: 


x = psing-cosé 
y = psing- 510 0 


z = рсовф 


the partial derivatives are: 


572 24 Area 


Fig. 24.30 Spherical polar 


coordinates 
Ox ig 0 Ox " 0 Ox ТН 
— =sing@-cosé, — = рсоѕф · с050, — = —psing: sin 
Эр эф” ө c 
д д д 
7 = sin $ - 5100, т = 0с05ф · sing, = = p sin $ - cos 0 
д д д 
Z coso, Z -= —psin, Z o0 
до дф 00 


Substituting these partials in (24.12): 


sin ф -cos@ p cos - cos0 —psing - sind 
J = | 510 ф - sin росоѕф -sinf рѕіпф · соѕ0 
cos ф —psing 0 


which expands to: 

det = p? cos? Фф. cos? 0 · sing + p? sin? Фф. sin? 0 + p? sin? Фф. cos? Ө + p? sing - sin? 0 - cos? ф 
= (0? sin? ф-- p? sing - cos? $) (sin? 0 + cos? ө) 
= p? sing (sin? ф-- cos? $) 


= p? sing 


Normally, we take the absolute value of the Jacobian determinant, but in this case, 
ф [0, л], and р? sin ф is always positive. Thus o? sin ф d$ 40 replaces dx dy dz 
in the appropriate integral. 

When using cylindrical coordinates, where: 


х= 0с05ф, y=psingd, 2-2 
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the Jacobian is p: 


Ox Ox Ox 
ди ду ди 
218% дУ дУ 
ди ду ди 
Oz Oz (Oz 
ди ду ди 
cos@ —psing 0 
= |510ф рсоѕф 0 
0 0 1 
= p cos? ф + p sin? ф 
= р 
Thus the first three Jacobians аге: 
Ox Ox Ox 
Ox дх ди ду dw 
дх ди ðv dy ду ду 
J = es J = , J — c 
: ди 2 ду ду à ди ду dw 
ди ду az üz de 
ди ду ðw 
which are often compressed to: 
дх a(x, y) д(х, у, 2) 
Л= ==, h= = 
ди д(и, v) д(и, у, м) 


24.13 Double Integrals for Calculating Area 


I will now illustrate how double integrals are used for calculating area, and in the 
next chapter, show how they are also used for calculating volume. To begin, look 
what happens when we integrate f(x, у) = 1 for x є [a, b], and y є [c, d]: 


d pb d pb 
І І f(x, »dxdy- | || 1 dx dy 
е а И а 7 
pa 


= (b — а)(4 — с) 
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Fig. 24.31 The projection of 
z= f(x, y) on the xy-plane 


The result is the product of the x- and y-intervals, which is the region A formed 
by a 3D surface projected onto the x y-plane, as shown in Fig. 24.31. The actual area 
of the surface created by z = f(x, y) bounded by the points РІ, P», Ру and Р; is 


given by: 
d b 2 2 
д д 
r= | [ ІНЕ (2) dx dy (24.13) 
o dä Ox ду 
Let’s show how (24.13) is used to compute area. The first example is simple апа 
is shown in Fig. 24.32, where z = f(x, у) = у. The intervals are x е [0, 2] and 


y € [0, 1]. By inspection, the area equals 24/2. Calculating the partial derivatives, 
we have: 


ША | апа ү 
дх ду 


therefore, (24.13) becomes: 
1 p2 
is | Ү1--02--12 dx ау 
о Jo 


- [| [ааа 
А» 
= [14 


= || 
= 24/2 


1 


0 
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Fig. 24.32 Part of the 
surface z = y 


Fig. 24.33 Part of the 
surface z = 4x + 2y 


The second example is shown in Fig. 24.33, where z = f(x, у) = 4x + 2y. The 
intervals are x € [0, 1] and y € [0, 1]. Calculating the partial derivatives, we have: 


therefore, (24.13) becomes: 
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1 1 
к-| | /+ё+2 ахау 
0 0 


=v f [acts 
af [e 


1 
=v | 1 ау 
0 


1 


10 
= «21 


0 


We can also calculate the area of the surface z = 4x + 2у contained within a 
specific region on the x y-plane as follows. For example, say the region is defined by: 


х?+ у? =1 
as shown in Fig. 24.34, we calculate the area as follows. 


To begin, we use polar coordinates instead of Cartesian coordinates, incorporating 
the vital Jacobian, and rewrite (24.13) as: 


z/2 pl ә 2 9 2 
R= І І j: + (2) ЕЗ (2) p dp 0 (24.14) 
0 0 Ox ду 
The inner integral integrates for p є (0, 1], and the outer integral integrates for 
0 € [0, л/21. Using the same equations, we have: 


Fig. 24.34 The graph of 
z = 4х + 2y intersecting the 
cylinder defined by 


x? + у? = 1 оп ће xy-plane 
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z/2 pl 9 2 9 2 
к-| ЕШЕ «(3 p dp d6 
о Jo ax dy 
л/2 pl 
=f | Vis +2 odo do 
0 0 
л/2 1 
- va | І р ао а0 
0 0 


1 


Е ут | С | _ 


л/2 
= vaf 140 
0 


л/2 


10 


_ М21л 
— 2 
А 7.2 


0 


For a third example, Fig. 24.35 shows part of a cone z = 4,/x? + y? intersecting 
a cylinder defined by x? + y? = 1 on the xy-plane. Let's calculate the area of the 
cone contained within the cylindrical region for р є [0, 1], and є [0, 2/2]. 

The partial derivatives are: 


д2 4х д 4у 


== de шиш 
дх / x? + y? S ду / x? + y? 


Fig. 24.35 The graph of 


z = 4A x? + y? intersecting 


the cylinder defined by 
х2 + y? = 1 on the xy-plane 


578 


therefore, using (24.14) we have: 
л/2 1 a 2 a 2 
AEG 
0 0 дх ду 
л/2 pl 4x 2 4y . 
= | [ 1+ (Ss ) F ( ) p dp d0 
0 0 Ух? + у? х2 + у? 
is 16x? 16y? 
= / [ "m ~ + p dp do 
хе + у? T Ty 


2/2 
= мт | f p dp dé 
0 0 


л/2 1 
= viv | B 40 
0 0 


л/2 
=v7 | 140 
0 


л/2 


10 


_ М17л 
77-2 
А 6.48 


0 


24 Агеа 


The above examples have been carefully chosen so that the radical within the 
integrand reduces to some numerical value. Unfortunately, this is not always the 


case, and integration has to involve software or numerical methods. 


24.14 Summary 


In this chapter we have derived formulae to compute the surface area of contours 
rotated about the x- and y-axis. The important formulae are repeated below. 
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24.14.1 Summary of Formulae 
Rotate about the x-axis 


b | 2 
5-2 | fx) 1+(2) dx 
a dx 


Rotate about the y-axis 


b 2 
5-2 | roni (8) dy 


If the function is described parametrically with x = f,(t) and y = fy(t) where 
t € [a, В], then: 


Rotate about the x-axis 
f dx\? (ауү 
5- әл | se (=) + (%) dt 
Rotate about the y-axis 
d dx\”  (dy\* 
s= 2л | (A) + (%) dt 


Double integrals for calculating the area of surfaces described by functions of the 
form z = f(x, y), then: 


Cartesian coordinates 


r= [f 


Spherical polar coordinates 


r= [ [үї+(®) 


2 ay 2 
+(#) dx dy 
dy 


2 


Oz А 
+ (S) p dp d0 
dy 
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The first three Jacobian determinants 


Ox Ox Ox 
ax Ox ди ду Ow 
дх ди ду ду ду ду 
J =: J = i J к= | Ее с, „шы 
1 ди í dy dy á ðu dv ðw 
ðu ðv az az д2 
ди ду Ow 
which are often written as: 

д a(x, a(x, у, 
ық 42809939. 4. PUN S 
ди д(и, у) д(и, v, w) 


Reference 


http://mathforum.org/dr.math/ 


Chapter 25 A) 
Volume сыны 


25.1 Introduction 


This chapter introduces four techniques for calculating the volume of various geo- 
metric objects. Two techniques are associated with solids of revolution, where an 
object is cut into flat slices or concentric cylindrical shells and summed over the 
object’s extent using a single integral. The third technique employs two integrals 
where the first computes the area of a slice through a volume, and the second sums 
these areas over the object’s extent. The fourth technique employs three integrals to 
sum the volume of an object. We start with the slicing technique. 


25.2 Solid of Revolution: Disks 


In Chap. 24 we saw that the area of a swept surface is calculated using: 


b 2 
5-2 | rei (4) dx 
5 dx 


Now let's show that the contained volume is given by: 


b 
Zu f (foo)? dx 


Figure 25.1 shows a contour described by y = f(x) rotated about the x-axis 
creating a solid of revolution. If we imagine this object cut into a series of thin 
slices, then the entire volume is the sum of the volumes of the individual slices. 
However, if we cut a real solid of revolution into a collection of slices, it is highly 
likely that each slice forms a right conical frustum, where the diameter of one side 
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Fig. 25.1 Dividing a volume У 
of revolution into small disks Р(х, y) у= }(х) 


ы” 


differs slightly from the other side. Therefore, our numerical strategy assumes that 
the slices are infinitesimally thin, and are thin disks with a volume equal to zr? Ax. 
Figure 25.1 shows a point P(x;, yj) on the contour touching a disk with radius f (xj) 
and thickness Ax. Therefore, the volume of the disk is: 


Wess) Ax 


Dividing the contour into n such disks, and letting n tend towards infinity, the entire 
volume is given by: 


y- lim ул Cf Y Ax 


i-l 
which in integral form is: 


b 
y- «| (FŒ)? dx (25.1) 


Let's apply (25.1) to the same objects used for computing the surface area of surfaces 
of revolution. 


25.2.1 Volume of a Cylinder 


The geometry required to compute the volume of a cylinder is shown in Fig. 25.2, 
where y = r is the radius, and h is the height. Therefore, using (25.1) we have: 


b 
vos | do as 


h 
=x | г? dx 
0 
h 
aan | 1 ах 
0 
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Fig. 25.2 Computing the yt 
volume of a cylinder 


25.2.2 Volume of a Right Cone 


The geometry required to compute the volume of a right cone is shown in Fig. 25.3, 
where у = rx/h. Therefore, using (25.1) we have: 


b 
7 "| FO ах 


Reversing the orientation of the cone as shown in Fig. 25.4, such that y = r(1 — 
x/h) we have: 


b 
vos | «озған 


h 5 x\2 
=л r (1 - >) dx 
0 h 
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Fig. 25.3 Computing the 
volume of a right cone 


Fig. 25.4 Reversing the 
orientation of a right cone 


We could have also integrated this as follows: 


b 
vee |. (РОЭ)? dx 
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Substituting: 
x 
= (ке 
£ h 
where du/dx = —1/h, or dx = —h du, and calculating new limits for и: [1, 0], we 
have: 


0 
У- ze f u?(—h) du 
1 


1 
= mh | и? du 
0 


1 1 
= лү?һ | — из | 
3 0 


= rh 
3 


25.2.3 Volume of a Right Conical Frustum 


Figure 25.5 shows the geometry to compute the volume of a right conical frustum, 
but this time the contour is rotated about the y-axis. The integral to achieve this is: 


b 
Veg | (РО) dy 


and the contour to be rotated about the y-axis is: 


х-(1- =) 


with the integral for the volume: 


h 2 
y 
v=x | 1-2) dy 
JM H 


However, in reality, we will not know the value of H, but we would know the values 
of rı and r2. Therefore, with a little manipulation, the contour can be written as: 


_ Arn У@2 — ri) 
h 
which confirms that when y = 0, x = rı, and when у = h, х = r2. Therefore, the 
volume can be written in terms of rj, r2 and Л as: 
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Fig. 25.5 Computing the y 


volume of a right conical 
frustum 


x= (1-2) 


h 
л 2 
у= dneym п)? ay 
0 


h 
--- hr? + 2hriy(r2 — т) + У (m ny dy 
0 


h 


1 
h?r2y + hriy (r2 ry) + 224 (ғ 2rir? + n) | 
0 


1 
( + hr (ra ry) + 5° (г; Әп + Э) 
(372 + Зи — 3r? + ry —2riro+ r?) 


For example, when 7; = 2 cm, r2 = 4 cm and h = 3 cm, then: 


V = (2+4 +8) = 280 en? 


25.2.4 Volume of a Sphere 
A sphere is easily created by rotating a semi-circle about the x- or y-axis, as shown 
in Fig. 25.6, where the equation of the contour is given by: 


y-mP-g 
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Fig. 25.6 A semi-circle y^ 
used to form a sphere 


25.2.5 Volume of an Ellipsoid 


Figure 25.7 shows part of an ellipse, which when rotated about the x-axis creates the 
volume of an ellipsoid. Using (25.1) with the equation for an ellipse: 


we have: 
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Fig. 25.7 Part of an ellipse A 
used to form an ellipsoid > 
b. 


| Lg 
-а P a 
Fig. 25.8 An ellipsoid 
where the ellipsoid’s volume is given by: 
a 
У=л f y! dx 
7 
b2 a 
=л— | (а – х?) dx 
а -а 


Figure 25.8 shows ап ellipsoid. 
Sweeping the ellipse about the y-axis creates another ellipsoid, with a different 
volume given by: 
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Observe that in both cases when а = b = г, the object is a sphere with a volume of 


4 3 
ЗЛГ 2 


25.2.6 Volume of a Paraboloid 


Figure 25.9 shows a parabola, which when rotated about the y-axis forms a 3D 
paraboloid 
To rotate about the y-axis the equation of the parabola is: 


х= 4/5 


where y є [0, А]. The volume of a paraboloid is: 


h 
ves x? dy 
0 


If x € [0, 1], then ^ = 1, and the volume is 7/2. Figure 25.10 shows a paraboloid. 


Fig. 25.9 A parabola, which у^ 
when rotated about the — 059Һ9ә4Һ9ҙ 5-2. А — 
y-axis creates a paraboloid N Р. 
\ y= x2 / 
“ РА 
ET P 
Rc. = we 
0 X 
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Fig. 25.10 А paraboloid 


25.3 Solid of Revolution: Shells 


A solid of revolution can also be constructed from a collection of concentric cylindri- 
cal shells as shown in Fig. 25.11, where the object’s shape is defined by the contour 
y = f (x) which is rotated about the y-axis. Figure 25.12 shows one of the cylindri- 
cal shells with a radius of x;, f (x;) high and Ax thick. As the shell is assumed to be 
infinitesimally thin, the volume of the shell is: 


V; = 2r x; f (xi) Ах 


Dividing the solid into и such shells, and letting n tend towards infinity, the entire 
volume is given by: 


V = lim У`2лх f (xi) Ax 


i-l 
which in integral form is: 


b 
yz 2л | x f(x) dx (25.2) 


Similarly, when the contour is rotated about the x-axis, the integral is: 


d 
У = 2л | у f(y) dy (25.3) 


Let's test (25.2) and (25.3) with various contours. 


25.3.1 Volume of a Cylinder 


Figure 25.13 shows the geometry to create a cylinder with radius r, and height h to 
be rotated about the y-axis. Using (25.2) the volume of a cylinder is: 
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Fig. 25.11 А series of 
concentric shells 


Fig. 25.12 Dimensions for 
one concentric shell 


Fig. 25.13 The geometry y 
used to create a cylinder 
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Fig. 25.14 The geometry 
used to create a right cone 


25.3.2 Volume of a Right Cone 


Figure 25.14 shows a straight line represented by y = А(1 — x/r), which when 
rotated about the y-axis sweeps out a right cone with radius r, and height h. The 
volume of a right cone is given by: 


vem s f(x) dx 
0 


d X 

=2 | xh (1-2) ах 
0 Г 
2 


Lond" 
= 2лһ | -x^ – = — 
2 3ғ Jo 
1 1 
= 2лһ (57 - i^) 
3 
1 
= ли?й 
3 


25.3.3 Volume of a Sphere 


Figure 25.15 shows the geometry to create a hemisphere with radius r to be rotated 
about the y-axis. As we have seen before, it is convenient to use polar coordinates 
when dealing with circles and spheres, therefore, our equations are: 


x-—rcos0 and y=rsiné 
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Fig. 25.15 The geometry yA 
used to create a hemisphere 


rsin@ 


0 тсов0 x 


The original interval for x is x е [0, r], which for 0 is 0 € [л/2, 0]. Therefore, 


dx 


10 =-rsind ог dx = -~-r sin 40 


Using (25.2) the volume of a hemisphere is: 


ү =2л | x f(x) ах 
0 
0 
=2л [ (r cos0 -r sin0 (—гзт0)) 40 
л/2 
0 
= -2лт° | cos 6 - sin? 0 40 
л/2 


0 
- am | cos 6 (1 — cos? 0) d0 
л/2 
0 0 
= -orr | соз Ө 40 + 2л? | cos? Ө 40 
л/2 л/2 
0 0 
= —2rr’ | sin 0 | өзге? | cos? 0 40 
л/2 л/2 
0 


= 2nr? + 2nr? | cos? 0 40 
л/2 


From Appendix В, we see that: 


3 1 : 2 2 2s 
cos Per аш (eus 8 + BI +C 
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Therefore: 


0 


w| = 


2 
sin Ө - cos? 0 + 1910 


V = 2лг? + 2л? | 
л/2 
2 


= 2nr? — 2nr3 
3 


which makes a sphere’s volume inr. 


25.3.4 Volume of a Paraboloid 


We have already seen that the volume of a paraboloid using y — x? is inh), where 


h is the height. The following shell method computes the volume surrounding the 
paraboloid, which using (25.2) gives: 


vom | x f(x) dx 
0 


and if x [0, 1], then h =r’, and У = izh. Which shows that the volume of 
inner paraboloid equals the enclosing volume. In order to compute the volume of a 
paraboloid using the shell technique, the parabola has to be inverted, as shown in 
Fig. 25.16. 


vom f x f(x)dx 
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Fig. 25.16 The geometry yA 
used to create a paraboloid 
hg 
yHh-x 
\ 
б 2 4 


But in our equation, h = r?, therefore: 


25.4 Volumes with Double Integrals 


Figure 25.17 illustrates a 3D function where z = f(x, y) over a region R defined 
by the limits a < x < b and c < y < d, whose area is projected onto the x y-plane. 
If we consider a small rectangular tile on the xy-plane with dimensions Ax and Ay, 
the volume of this column is approximately: 


ДУ ж f (xi, yj) Ax. Ay 


where i and j identify a specific tile. Therefore, the total volume is: 
Vm у, Ло, у)) Ах.Ау 
hj 
In the limit: 


V= lim f(x, y)Ax.Ay 
ij 


Ах, Ау->0 
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Fig. 25.17 А surface 
created by z = f(x, y) 


or in integral form: 


b d 
v=f І Jis йк 


where the inner integral is evaluated first, followed by the outer integral. The integral 
can be written in two ways: 


b pd d pb 
у= [ose yaxdy= | | ro. у) dy dx (25.4) 


Let’s apply (25.4) in various scenarios. 


25.4.1 Objects with a Rectangular Base 


25.4.2 Rectangular Box 


Figure 25.18 shows a rectangular box whose top surface is defined by z = h, with 
base dimensions (x2 — хі) and (y? — yi), where the enclosed volume is: 


У = (0 = х1) (у — y) 


Fig. 25.18 А rectangular 2 
box 


uh 
dada ) » Qa, ys, h) 


(х,у һ) Уә, h) 


у 
(ха, уз, 0) 


(X2, У, 0) 
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This is confirmed by (25.4) as follows: 


y2 х2 
= / | Pe shady 
У м 


y2 
= h(x2 — хі) 1 ау 
У 


У2 
= һ(0 — х1) E | 
У 


= h&a — xi)(y» — у) 


25.4.3 Rectangular Prism 


Figure 25.19 shows a rectangular prism whose top sloping surface is defined by 
z = h(1— x/a), with base dimensions a and b, where the enclosed volume is: 


1 
У = ^4 


This is confirmed by (25.4) as follows: 


y» X2 
- | | f(x, у) dx dy 


МЕ! ) ахау 
Ware 1-2) ) ахау 
о Jo a 
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Fig. 25.19 A prism 2 
| 
x 
2- h (1 == J | h 
| 
/^(0,b,0) У 
y 
y 
(a, 0,0) (a, b, 0) 


Fig. 25.20 An object with a 
curved top 
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25.4.4 Curved Тор 


Figure 25.20 shows an object with a square base and curved top defined by z = 
x? + у. Given that {x, у} е [0, 1], then Фе enclosed volume is: 
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y2 X? 
2- | / F(x, у) ах dy 
У м 


25.4.5 Objects with a Circular Base 


The same double integral works with polar coordinates, which enables us to compute 
the volume of objects with a circular base. We have already seen that when moving 
from Cartesian coordinates to polar coordinates, the appropriate Jacobian must be 
included. In this case, the following substitutions are: 


x = рсов0 
у =psing 
dx dy = p dp dé 


which transforms (25.4) into: 


b d 2л ғ 
= | І Жо, у) ах ay= | | f(pcos@, рѕіпӨ)р араб (25.5) 
а с 0 0 


Let’s test (25.5) using various objects. 


25.4.6 Cylinder 


The volume of a cylinder with radius r and f (p cos 6, psin@) = h is mr*h, which 
is confirmed as follows: 
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2л r 
ү =f f f(ocos0, рзтд)р dp 40 
о Jo 


2л г 
= | | hp ара 
0 0 


25.4.7 Truncated Cylinder 


The volume of a truncated cylinder is calculated by forming the intersection of a 
cylinder and an oblique plane. The following proof confirms that the volume equals 
лг?һ, because the cylinder’s height, Л, is the z-axis. To illustrate this, Fig. 25.21 
shows a side projection of a cylinder intersecting the plane: z = h — Ax/r, where 
à controls the slope of the plane. It is clear that the two cross-hatched triangles are 
equal, which is why the volume is unchanged: 


2л r 
ү ii І / (о соѕ0, рзтд)р dp dé 
о Jo 


ZIP fT Ар cos Ө 
= h — —— |04040 
0 0 r 
2л а Ap? cos 0 
І ph — ——— | dp dé 
0 Г 


1 
rh— QU cos ) dà 


1 2л 
- 22 | (3h — 24. соѕ0) 40 
6 Л 
1 А | 2л 
= -r 3h0 — 2А sin 0 
6 0 
1 
= gr бл! 


= rh 
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Fig. 25.21 Cross section of 
a cylinder and intersecting 
plane 


If the radius is 2, and the height 4, then the volume is 167r. Observe that the result 
is independent of A. Taking this cylinder and intersecting it with the parabola, z = 
2+ ix? as shown in Fig. 25.22, the volume reduces to 107: 


2л 1 2 
zi LE d0 
0 8 0 

4 


+ 2 cos? 0 40 


Fig. 25.22 А cross-section % 2А : 
of parabola intersecting a H 4 ҒА = 72/242 
cylinder 3 | 
2 
2 0 2 4 ы 
x 
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2л 
= f 5 + cos(20) 40 
0 


2л 


-| 50 + : sin(20) | 


= 10л 


0 


25.5 Volumes with Triple Integrals 


The double integral for calculating area is: 


ІШЕ y)dxdy or ІШЕ y)dA 
R 


R 


where the region R is divided into a matrix of small areas represented by dx dy or 
dA. The Riemann sum notation is: 


Jl f(x, y) dA = lim > Жо, yi) АА; 
noo & 
R і-і 
This notation can be generalised into a triple integral for calculating volume: 


[| f(x, у, z)dx dydz or Hi f(x, y, z) dV 
R R 


where the region R is divided into a matrix of small volumes represented by dx dy dz 
or dV. The Riemann sum notation is: 


Jf to y, z)dV = lim у РО, yi, zi) АУ 
noo i 
R = 


Let’s apply (25.6), where each integral identifies its interval of integration, to various 
3D objects and calculate their volume: 


b pd pf 
vef | | te. у, <) dx dy dz (25.6) 


25.5 Volumes with Triple Integrals 603 


Fig. 25.23 Cartesian 2 
coordinates for а rectangular 
box 


(x2, Ул, 22) (ха, Yz 22) 


(%2, V1, 22) ,у,%) 


y 
Gi yo 2) 
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25.5.1 Volume of a Rectangular Box 


Figure 25.23 shows the Cartesian coordinates for a rectangular box, with x-, y- and 
z-lengths are (x2 — x4), (y? — уі) and (z2 — zı) respectively, and whose volume is 
calculated using (25.6) as follows: 


b pd pf 
у= / Т f f(x, y, z)dx dy dz 
5 Ў ка x2 
= | f 1 dx dy dz 
21 У м 


Together, the three integrals create the product of three lengths: 


Х2-Х, Уи, 22-21 


which form the volume of the box: 


22 y2 х2 
У = f f | x | dy dz 
21 У м 
22 y 
=œ- f [rase 
21 » 
22 y 
= ам f В dz 
а я 


22 
= (x2 — х1)0>- » f 1 dz 
21 


22 


= (0 — х1) (2 — у) [1 


21 


= (x)—xi)» — у1) (2 — zi) 


which confirms that the volume is the product of the box's linear measurements. 
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Fig. 25.24 Тһе first 2% 
quadrant of a circular arc 


25.5.2 Volume of a Cylinder 


Figure 25.24 shows a quadrant of a cylinder with radius r, and height h. Its volume 
is computed by dividing the enclosed space into cuboids with a volume AV; — 
ôx - dy - óz. In the limit, as ёх, ду апа ôz tend towards zero, the entire volume is a 
Riemann sum, and a triple integral: 


h r м/у? 
V= І І І 1 dx dy dz (25.7) 
0 Јо JO 


The solution looks neater if the integrals are evaluated as follows: 


r руту ph 
vel f | 1 dz dx ау 
о Jo 0 
r Nm h 
ET В Ах dy 
о Jo 0 
=” | | 14х ау 
о Јо 


- Гро 


0 
x yr? — y? dy 
0 


Let y = r sin 0, then: 


F dy = r cos 40 
40 = r сов оғ y = F COS 
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and the interval for 0 is 0 € [0, 2/2], therefore: 


л/2 
У = | Vr2—r2sin? 0 r cos0 40 
0 
2/2 
= rn | cos? 0 40 
0 


1 л/2 
- rh | 1 + cos(20) 40 
2 0 


1 л/2 


1 
= sf | 0+ 5 sin(20) | 


= -лг?һ 
4 


As there are four such quadrants, the volume of a cylinder is z?/. 
Cartesian coordinates are not best suited for this work —it is much more convenient 
to employ cylindrical polar coordinates, where: 


х= осо$ф, y=psingd, 2-2 


and the Jacobian is o. Therefore, (25.7) is written to represent the entire volume as: 


h 2л r 
= | І f 0 dp аф dz 
0 0 0 


which is integrated as follows: 


2л 
[ p dp аф dz 


0 


2л F | r 
= dod 
j^ |, aa 


| 
nef [vee 
ЛО 
| 

| 
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Fig. 25.25 Spherical Polar 
Coordinates 


25.5.3 Volume of a Sphere 


Figure 25.25 shows how a sphere is defined using spherical polar coordinates, where 
any point has the coordinates (р, ф, 0). In order to compute the volume of a sphere, 
the following intervals apply: о € [0, r], Ф € [0, л], and 0 є [0, 27]. Using the 
Jacobian р? sin ф, the volume is: 


2л л г 
"= | [ І o? sing dp аф 40 
2л 
[ E | sin $ аф dé 
0 А 
И! | sin Ф аф d0 
0 
r xA | — совф I d0 
0 0 
2л 
p І 140 


> 
T 1 

S 
— 
АЕ: 


> 


шә WIN UIN wole ei 


ч 
3 


25.5.4 Volume of a Cone 


The triple integral provides another way to compute the volume of a cone, and is 
best evaluated using cylindrical polar coordinates, rather than Cartesian coordinates. 
Figure 25.26 shows an inverted cone with height л and radius ғ. The equation for 
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Fig. 25.26 A cone with 
cylindrical coordinates 


the cone is given by: 


h 
z= Nee 


where any point in the cone has a distance р = \/х? + y? from the z-axis. Thus when 
р =r,z = h, and when р = 0, z = 0, which provides the cone's shape. We are only 
interested in the volume between z = 0 and z = A. 

Thus the intervals for the three cylindrical coordinates are: ф є [0, 27], o € 
(0, r] andz € [р, h], and using the Jacobian p, the triple integral is: 


r 2л h 
V= | f І аф dz p dp 
0 J0 hp/r 
Integrating from the inside outwards, we have: 
r h 2л 
у = І f І аф dz p dp 
0 Jhp/r 40 
r h 2л 
= | f І аф dz p dp 
0 Jhp/r 40 
r h 2л 
“LET em 
0 Yhp/r 0 
r h 
=2л | І 1 dz p dp 
0 Jhp/r 
r h 
= 2л | | 2 | p dp 
0 hp/r 
2 һ 
2л | ( - te) p dp 
0 r 
r h 2 
an | (10 - ~~) ар 
0 ғ 
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25.6 Summary 


Integral Calculus is a powerful tool for computing volume, whether it be using 
single, double or triple integrals, and this chapter has covered four techniques using 
the following formulae. 


25.6.1 Summary of Formulae 
Slicing: Rotating f (x) about the x-axis 
b 
Van T (Sœ)? dx 
Slicing: Rotating f(y) about the y-axis 
b 
vos | (roy? ay 
Shells: Rotating f(x) about the x-axis 
b 
v=2 | x f(x) dx 


Shells: Rotating f (x) about the y-axis 


b 
vem y f(y) dy 


25.6 Summary 


Surface function f(x, y) using rectangular coordinates 


b d d b 
vef | ro »ахау= || | то asas 


Surface function f(x, y) using polar coordinates 


b ра Dmax max 
y= І Хо, »dxdy- | f(ocos0, psin0) р 40 dp 
a с 6, 


min тіп 


Triple integral using rectangular coordinates 


b pd pf 
у= | f | fev оаа 


Triple integral using cylindrical polar coordinates 


стах рФтах р Pmax 
"= | f f f(e, $, z) p dp dọ dz 


Zmin min min 
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Chapter 26 (Я) 
Fourier Series Check for 


26.1 Introduction 


This chapter shows how certain waveforms are approximated by a series of cosine 
and sine functions, called a Fourier Series. The idea was proposed by the French 
mathematician Joseph Fourier (1768-1830), who claimed that any function of a 
variable, whether continuous or discontinuous, can be expanded into a series of sines 
of multiples of the variable. (https://en.wikipedia.org/wiki/Joseph_Fourier) Today, 
this claim has been slightly tempered by mathematicians. 

This chapter is based upon my favourite book on mathematics Mathematics: From 
the Birth of Numbers (Gullberg 1997). 


26.2 Mathematical Foundations 


Fourier suggested that certain waveforms are represented by an infinite series of 
cosine and sine waves defined by: 


f(x) = D d 2 cos(nx) + УЬ, sin(nx) (26.1) 


n=1 


Equation (26.1) is often written as: 


ao = . 
Ро) = с: Ж аһ cos(nx) + b, sin(nx) (26.2) 
n=1 
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where: 


f (x) is a function for frequency f and is 
continuous over the interval [-л < x < л]. 
ао is a constant. 
n identifies the Fourier series term. 
а, is the amplitude of the cosine term. 


Б, is the amplitude of the sine term. 


To find an expression for ао we integrate f(x) in (26.2) over the interval [-л < 
x <л|: 


Г. f(x)dx = “/ ase Y (f a, osna) ах + | b sinn) ах) 


п=1 


= ES I | > [= sin(nx) ” созш) 


п=1 mi 


л 


ы аһ. 4 bn 
= адл >| x [sin(nz) — sin(—nz)] " [cos(nz ) опт] 


n=1 
20 [а b 
n п 
= аул ЭЕ [01— — о] 
п=1 
= ал 


1 л 
aj = = f(x)dx 


=F: 


We can find an expression for а, where n > 1, by first multiplying f(x) in (26.2) by 
cos(nx) and then integrating with respect to x over the interval [-л < x < л]. This 


results in: 
т 


а, = 1 f(x): cos(nx) dx 
Jon 


Similarly, we can find an expression for b, where n > 1, by first multiplying f(x) 
іп (26.2) by sin(nx) and then integrating with respect to x over the interval [-л < 
x < л]. This results in: 


п 


b, = с f(x) - sin(nx) dx 
T Ут 


26.3 Discontinuous Functions 613 


26.3 Discontinuous Functions 


A square wave is an excellent example of a discontinuous function, as shown in 
Fig. 26.1, where the function is 0 over the interval [-л < x < 0], and is 1 over the 
interval [0 < x < л]. The function is discontinuous at x = —л, х = 0 and x = л, 
and has a period of 2л. 

Over the interval [0 < x < л], the value of ag, а, and Б, are: 


1 T 
a f ldx=1 
7T Jo 
1 л 
a, = — 1. cos(nx) dx = 0 
T Jo 
1 f* : 
b, = = | 1 - sin(nx) dx 
T Jo 
1 
л 


2 
= — (for odd n) 
пл 


These values confirm that the series contains only a constant term, and sine terms: 


f (x) | | | | | x = Кл wherek е 2 


1 2 | sin(x) | sin(3x) | sin(5x) | sin(7x) | 
2 m, I 3 5 g-0 e 


Figure 26.2 shows this function by adding the extra terms. Figure 26.3 shows the 


: sin(23x) 
function up to ==. 


fo, 

о Ло 
о o» 
-2n -п 0 п 2n X 


Fig. 26.1 Sketch of a square wave 
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Fig. 26.2 Superimposed graphs with each colour showing an extra term in the series 


fx), 
| ЕШ | 
Sn Ei 0 " bx 


s ; : іп(23х) 
Fig. 26.3 The Fourier series up to 22777 


26.4 Even and Odd Functions 


As function products are involved in the definitions of ao, а, and b,, it helps develop 
a Fourier series if a function is described as even or odd over the interval [-л < x < 
л). Figure 26.4 shows an even function y = a(x), and an odd function у = b(x), 
with the areas marked +ve and —ve. Areas are often used for predicting the nature 
of a Fourier series. 

We know that an even function is defined by f(—x) — f(x), which makes the 
cosine function an even function. Conversely, ап odd function is defined by f (—x) = 
— f (x), which makes the sine function an odd function. Table 26.1 shows the product 
of two such functions, where we note that the product of two even functions, or two 


Fig. 26.4 Even and odd 
functions 
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Table 26.1 The product of even and odd functions 
Product Even | Odd 


Even Even Odd 
Odd Odd | Even 


odd functions, is an even function. The product of an even and odd function is an 
odd function. These results are quickly confirmed by noting that cos? x and sin? x 
are even functions, whilst cos(x) - sin(x) is odd. 


26.4.1 Even Functions 


We begin by assuming that f (x) is an even function over the interval [~x < x < л], 
and we will show that the Fourier series for f (x) only contains cosine terms. What 
follows is the values of ао, a, and Dy. 

Starting with: 


16) = D + У an cos(nx) + b, sin(nx) 


n=1 


then: 


As f (x) is an even function, the area of f (x) over the interval [~m < x < 0] equals 
the area of f (x) over the interval [0 < x < л], therefore: 


2 T 
ао = =] f(x)dx 
T Jo 


Similarly, a, is given by: 


а, = L 7 f(x) - cos(nx) dx 
л 


—л 


2 Г f (x) - cos(nx) dx 
T Jo 
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The last constant is b,, where: 


b, = Ж f(x) - sin(nx) dx 
л 


—л 


which contains the product of an even and odd function, and produces ап odd function. 
The integral of an odd function, with a period of 277, over the interval [-л < x < л] 
is zero, therefore: 

b, =0 


Thus, if f(x) is an even function, the Fourier terms are given by: 


Ғо- D + Уа, cos(nx) 


п=1 


26.4.2 Odd Functions 


Next, if f (x) is an odd function over the interval [-л < x < л], then we will show 
that the Fourier series for f(x) only contains sine terms. What follows is how we 
calculate ao, а, and by. 

Starting with: 


f@= D + Ya, cos(nx) + b, sin(nx) 


n=1 


then: ЛЕ. 
а = — f(x) ах 
Л Jin 


As f (x) is ап odd function, ће area of f (x) over the interval [-л < х < л] equals 
zero, then: 
ag = 0 
The value of a, is given by: 
1 л 
а, = — f (x): соѕ(их) dx 
л 


— 


and as f (x) is odd, and the cosine is even, the resulting function is odd. The integral 
of an odd function over the interval [-л < x < л] is zero, therefore: 


а, = 0 
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The last constant is b,, where: 


р, = E " Р(х) - sin(nx) dx 
л 


—" 


which contains the product of two odd functions, and produces an even function, 
therefore: 


b, — 1 " f(x) - sin(nx) dx 
л 


-7 


2 п 
= | f(x) - sin(nx) dx 
7T Jo 
Thus, if f (x) is an odd function, the Fourier terms are given by: 
оо 
Ро) = Y b, sin(nx) 
п=1 


We have shown that if f (x) is ап odd function, ће Fourier series only contains the 
sine terms. 


26.4.3 The Fourier Series for a Triangular Function 


A triangular function shown in Fig. 26.5 is described by: 


Fig. 26.5 А triangular fx) 
function ^ 
n 
0.5п 
-2п -п 0 п 2п > 
-0.5n 
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Р(х) = S + Уа, cos(nx) 


n=1 
2 л 
«== | f(x)dx 
л Jo 


а, = zf f (x): cos(nx) dx 
л Jo 
b, = 0 


The waveform has a period of 27 and an amplitude of т. As we are integrating over 
the interval [0 < x < л], we only have to describe f (x) over this interval, which is 
the straight line f(x) = x, or more accurately f (x) = |x|. 

The values of ао and a, are given by: 


2 T 
а = — | |х|ах 
л Jo 


=л 
2 T 
аи = — |x| - cos(nx) dx 
л Jo 


This integral is evaluated by parts where we use: 


л л 
І ш/4к-ш- | уи! dx 
0 0 
sin(nx) 


We begin by letting и = |x| and v’ = cos(nx), where и’ = 1 and v = 777 
Integrating by parts: 


2 Г , 2 [№ - f | 
- uv dx = — ои! dx 
T Jo T 
2 = ino 7 sin(nx) | 
ах 


| п 


2|х- T cos(nx) |" 
= + 5 
л п п б 
2 
= 0+ — · | cos(nx) 
лп? м 


= 0 (for even л) 


4 
=——— (for odd л) 
лп 
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Fig. 26.6 А triangular y 

function represented by 4 

Fourier terms ШОЛ 
3 
2 
1 

-2n -T 0 п 2n X 

-1 

Therefore: 

л 4 (сов(х) | cos(3x) | cos(5x) cos(7x) | 
о) = = ( p fw t ә 5 т e x = кл wherek € Z 


where we see that f (x) is composed entirely of odd cosine terms. Figure 26.6 shows 
this function containing 4 Fourier terms. 


26.5 Summary 


Hopefully, this short chapter is sufficient to show that certain functions can be repre- 
sented by a series of sine and/or cosine terms, and will inspire readers to learn more 
about this interesting topic. 


26.6 Worked Example 


26.6.1 Fourier Series for a Sawtooth Function 


Find the Fourier series for the sawtooth function shown in Fig. 26.7, where f (x) has 
a period of 2z and [-л <x < л]. 


Solution 


The sawtooth function f (x) is discontinuous and an odd function, therefore, 40 = 0, 
and a, = 0. 


f(x) = |х| 
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Fig. 26.7 A sawtooth 
function 
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b, 


п 


2 


п 


1 |x| - sin(nx) dx 
0 


This integral is evaluated by parts where we use: 


л л 
І иу dx = uv -f ои! dx 
0 0 


We begin by letting и = |x| and v' = sin(nx), where и’ = 1 and v = 


Integrating by parts: 
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Fig. 26.8 A sawtooth function represented by 8 Fourier terms 


where we see that f(x) is composed entirely of sine terms. Figure 26.8 shows this 
function containing 8 Fourier terms. 
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Appendix A 
Limit of (sin 0)/0 


This appendix proves that: 


. sing - : 
lim —— — 1, where is in radians 
9—0 


From high-school mathematics we know that sin 0 7 0, for small values of 0. For 
example, using radians: 


sin 0.1 ~ 0.099833 
sin 0.05 ~ 0.049979 
sin 0.01 ~ 0.009999 


and 


ле 0.99833 


sin 0.1 
0.1 


^ 0.99958 


sin 0.05 
0 


sin 0.01 
0.01 


= 0.99998 


Therefore, one can reason, that in the limit, as 0 — 0: 


. sind 
lim —— = 1 
020 60 


Figure А.1 shows a graph of (sin 0) /0, which confirms this result. However, this is 
an observation, rather than a proof. So, let's pursue a geometric line of reasoning. 

From Fig. A.2 we see as the circle's radius is unity, ОА = OB = 1, and AC = 
tan 0. As part of the strategy, we need to calculate the area of the triangle AO AB, 
the sector OAB and the AOAC 


© Springer Nature Switzerland AG 2024 623 
J. Vince, Foundation Mathematics for Computer Science, 
https://doi.org/10.1007/978-3-031-66549-3 


624 Appendix A: Limit of (sin 6) /0 


Fig. А.Л Graph of (sin 0)/0 yA 


Fig. A.2 Unit radius circle 
with trigonometric ratios 


Area of AOAB = AODB + ADAB 
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Area of AOAC = 209 tan 0 
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From the geometry of a circle, we know that: 


1. 1 1 
— 5100 < -9 < —tanO 
2 2 2 


: sin 0 
sind «60 < 


cos Ө 
0 1 
— < 
sin 0 cos 0 


sin 0 
1> кр > сов0 


1 < 


апа as Ө — 0, cos@ — 1 and = — 1. This holds, even for negative values of 0, 
because: 


sin(—0) = — sin _ sin 0 


Therefore: 


Appendix B 
Integrating cos” 0 


We start with: 


БЕ ах- [osx -cos’! x dx 


! x and у’ = cos x, then: 


Let и = cos" 
и! = —(n — 1) cos"? x · sin x 
and 


у = зах 


Integrating by parts: 


и’ ах = и | ас 


| cos”! x. cosx dx = cos" lx -sinx + f sinx- (n — 1) cos"? x . sinx dx +С 
= sin x cos" 7! x + (n — D f sin? x eos"? » dx+C 
E n-1 2 n-2 
= sinx- cos x + (n — 1) | (1 — соѕ x): cos xdx+C 


= sinx-cos?7! x + (n — 1) f ox? dx — (n — D fos dx +С 


n | cos" x dx = sin x - cos" ^ x + (n — 1) | cos"? dx +С 


> sin x · cos x п-і1 7-2 
cos" x dx = | cos dx +С 
п п 


where n is an integer, Æ 0. 
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Similarly: 
:AQn-l 
2 cosx sin" x п—1 DASS 
ES х ах = + ES dx + C 


n n 


For example: 


я 1. 2 2. 
cos TEE ашу ON Feo. sae 
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Jacobian, 565 
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transform, 363 
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3D 
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